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1. Introduction 

The Plateau’s problem investigates those surfaces of least area spanning a given contour. 
It is one of the most classical problems in the calculus of variations, it lies at the crossroad 
of several branches of mathematics and it has generated a large amount of mathematical 
theory in the last one hundred years. The problem itself and its various generalizations 
have found fundamental applications in several mathematical and scientihc branches. Since 
it is a prototype of a vast family of questions with geometric and physical signihcance, the 
techniques developed to analyze it have proved to be very useful in a variety of other 
situations. 

Although its original formulation is restricted to 2-dimensional surfaces spanning a given 
curve 7 in the 3-dimensional space, in this note we will consider the case of (suitably 
generalized) m-dimensional surfaces in oriented Riemannian manifolds S. We will restrict 
ourselves to compact (sufficiently smooth) S and since all the considerations will be of a 
local nature we will often assume that S itself is isometrically embedded in some euclidean 
space (of dimension m -|- n). In this way the competitor surfaces (classical or generalized) 
spanning the contour 7 will always be (suitable generalizations of) subsets of the standard 
euclidean space, constrained to be subsets of S. Although this is not very elegant from a 
geometric point of view, it allows us to avoid a lot of technicalities. 

The very formulation of the Plateau’s problem has proved to be a quite challenging math¬ 
ematical question. In particular, how general are the surfaces that one should consider? 
What is the correct concept of “spanning” and the correct concept of “m-dimensional vol¬ 
ume” that one should use? The author believes that there are no hnal answers to these 
two questions: many different signihcant ones have been given in the history of our subject 
and, depending upon the context, the features of one formulation might be considered 
more important than those of the others. 

One popular way to generalize the concept of surface is to consider it in duality with 
differential forms: on a smooth m-dimensional oriented submanifold of S we can integrate 
compactly supported forms and this action gives a natural relation between smooth sub¬ 
manifolds and linear maps on the space of smooth (compactly supported) forms P™^(S). 
Following a pioneering idea of De Rham (cf. [36]) we introduce the m-dimensional cur¬ 
rents^ which are linear functionals on D™'(S) satisfying a suitable continuity property. The 
action of a current T on a form uj is then given by T{u:) and we can introduce naturally 
a concept of boundary “enforcing” Stokes’ theorem: dT{uj) := T{du). In fact currents 
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can be thought as an extension of the classical notion of chain, namely of formal linear 
combinations of submanifolds, and the space of currents with the operator d forms a chain 
complex: this in turn leads to an homology theory which can be shown to be equivalent 
to the several other (classical) homology theories. There is also a natural generalization of 
the concept of m-dimensional volume and using standard functional analytic methods it 
is then possible to show the existence of minimizers for the Plateau’s problem. As a nice 
byproduct of this theory we can represent each class in (real) homology with a cycle of 
least mass. 

Note however that De Rham’s theory allows real multiplicities. This is not natural from 
the point of view of Plateau’s problem (in fact generalized minimizers might have non¬ 
integer multiplicities, cf. Example 2.8 below). We could try to remedy to this shortcome 
by considering only the subspace of currents which can be represented as classical chains 
with integer multiplicities and then look at its closure in the appropriate topology. This 
point of view was taken in the celebrated paper [41] by Federer and Fleming and the 
corresponding objects, called integral currents, provide an ideal framework for the general 
oriented Plateau’s problem. A very nice feature of the Federer and Fleming theory is that 
each integral homology class can be represented by a cycle of least area. In fact the theory 
is powerful enough to deal with more general coefficient groups. 

There are two hrst fundamental issues that a satisfactory variational theory needs to 
address: existence and regularity. Of course these are by no means the only important 
aspects of Plateau’s problem: however almost all the other necessarily build on these two 
important pieces of information, namely that 

(a) there is a minimizer for a large class of boundaries; 

(b) the minimizer is sufficiently regular, so that one can compute interesting geometric 
quantities and infer additional conclusions. 

The success of the Federer and Fleming’s theory is due to the vast applicability of its exis¬ 
tence part in all dimensions and codimensions. Thanks to the efforts of several outstanding 
mathematicians a rather far-reaching (and satisfactory) regularity theory was achieved in 
the seventies in codimension 1 (see for instance [46]). This theory has been digested by the 
subsequent generations of scholars working in differential geometry and PDFs, leading ulti¬ 
mately to many breakthroughs in different problems in geometry, PDFs and mathematical 
physics. Indeed the codimension 1 case is considerably easier than the higher codimension: 
the reason is that integral m-dimensional cycles in are in fact (countable integral 

combinations of) boundaries of sets. The Federer-Fleming theory is thus equivalent to the 
theory of sets of hnite perimeter (or Caccioppoli sets), which was indeed introduced a few 
years earlier by De Giorgi, developing a pioneering idea of Caccioppoli (cf. [12, 18, 19]; for 
an english translation of De Giorgi’s works we refer to [22]). 

In the higher codimension case the most important conclusion of the regularity theory 
can be attributed to the monumental work of a single person, F. J. Almgren Jr. ([5]). 
Unlike the codimension one case, only a relatively small portion of the theory of Almgren 
has been truly understood. In a recent series of papers Emanuele Spadaro and the author 
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have given a new, much shorter, account of Almgren’s regularity program, relying on the 
several advances in geometric measure theory of the last two decades and on some new 
ideas. The aim of this note is to give a rather detailed picture of the several issues that 
this program must face and of how they are resolved. 

We will assume that the reader is familiar with the basic concepts of the theory of 
integer rectihable currents: we will therefore only recall them very quickly to lay a common 
notational ground (the reader who is not familiar with this part of geometric measure theory 
is instead encouraged to read hrst the survey [24]). As part of our exposition we will isolate 
the features of codimension 1. In Section 3 we will recall the hrst considerations in the 
regularity theory and summarize the state of the art in the subject. We will then review the 
regularity theory for minimizers in codimension 1, focusing on its most important steps, 
cf. Section 4. The reader who is very familiar with these topics can skip the corresponding 
sections. However some of the discussions in there will play an important role in the rest of 
the note, where we will describe the details of the proofs of Almgren’s theorem as presented 
in the papers [26, 27, 30, 28, 29]. 

2. Terminology and fundamental results in the theory oe currents 

2.1. General currents. The idea of treating (oriented) surfaces as linear functionals on a 
suitable space of smooth objects dates back at least to De Rham, cf. [36]. In what follows 
we denote by D™(f2) the space of smooth compactly supported m-forms in where is 
a differentiable manifold. We will omit H when it is clear from the context. 

Definition 2.1 (Current, De Rham, cf. [58, Dehnition 26.1]). An m-dimensional current 
T is a continuous linear map T : I1™(D) —)■ R. Here the continuity condition must be 
understood in the following sense: T{u^) —)■ T{u) whenever {u^} C D™' is a sequence such 
that 

(a) there is an open set D' CC D with spt(a;^) C D' for every k] 

(b) oj^ ^ u in C^{VL') for every j. 

A sequence of currents (of the same dimension m) converges to T if T^{u) T{u) for 
every u G D™ (cf. [58, eq. 26.12]). 

As already mentioned the concept of boundary is dehned “enforcing” Stokes’ Theorem. 

Definition 2.2 (Boundary, De Rham, cf. [58, eq. 26.3]). We say that an (m — 1)- 
dimensional current S is the boundary of an m-dimensional current T if 

T{duj) = S{u) for every u G (1) 

S will then be denoted by dT. 

The class of smooth oriented submanifolds T C D with smooth boundaries can then 
naturally be viewed as a subset of the space of m-dimensional currents in D. In order 
to distinguish between any such T and its “action” as linear functional via integration of 
forms, we will use the notation [T] for the current, namely [T] (ca) := JpO; (cf. [58, eq. 
26.2]). If we consider points P G D as 0-dimensional submanifolds, consistently with our 
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convention we will denote by |P] the usual Dirac delta at P, although a more common 
symbol would be 5p. 

Although the dehnition above is quite general, in the rest of this note we will always 
assume that D is an open subset of the euclidean space If S is then a submanifold 

of we will consider currents T in S as currents T in whose support spt(T) is 

contained in S. As usual the support spt(T) is the complement of the maximal open set 
U for which T{uj) = 0 whenever spt(a;) C U (cf. [39, Section 4.1]). 

Recall that a simple m-vector is an element of of the form ui A ... A Vm- 

Moreover there is a natural dehnition of length of a simple m-vector: |ni A ... A Uml is the 
m-dimensional Hausdorff measure of the parallelogram spanned by the vectors ui,..., Vm- 
I • I can be easily extended to a Hilbert norm on Am(R™''''”) and the corresponding scalar 
product takes the following form when computed on simple vectors: 

(ui A ... A Vm, Wi A ... A Wm) = det (n* • Wj). 

Definition 2.3 (Comass and mass, cf. [39, Section 1.8]). Let u E A™'(M™'+"^). Then the 
comass of u is the norm 

||a;|| := max {(a;, Ui A ... A Vm) : • d' '^ml = 1} , 

(here (,) is the standard duality pairing between covectors and vectors). The dual norm 
on Am(R™^"'), called the mass norm, is 

||T|| = max |(a;, T) : ||a;|| < l| . 

If ca G D™'(M™'+"') we then dehne its comass norm to be 

||a;||c := max ||a.;(p)|| . 

p 

Definition 2.4 (Mass of a current, cf. [39, Section 4.1.7]). Given an m-dimensional current 
T we denote by M(T) its mass, namely the quantity 

M(T) = sup {T{u) : [Italic < 1} • (2) 

For any open set D we dehne 

||T||(r2) = sup{T(ti;) : spt(a;) C D and ||ta||c < 1} . (3) 

When T is a current of hnite mass, namely M(T) = ||T|| (M™'”''"^) < oo, ||T|| turns out 
to be a Radon measure (cf. [39, Sections 4.1.5 and 4.1.7]). The same holds under the 
weaker assumption that ||T||(r2) < oo for any bounded open set D: in this case we will 
say that T has locally hnite mass. If we endow Am(R™'’''"^) with the mass norm, T can 
then be seen as a measure taking values in a (hnite-dimensional) Banach space and ||T|| is 
the total variation measure of T. It is customary to take the “polar (or Radon-Nikodym) 
decomposition” (cf. [58, eq. 26.7 and Dehnition 27.1]) of T and infer the existence of a 
Borel-valued map T such that T = T||T||, namely 

m) = j{oj{p),f(p))d\\T\\{p) Vo. e ©“(R"). 


(4) 
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The comass of T{x) equals 1 at every point x. In particular at those points where T{x) is 
simple, it can be represented as Ci A ... A for a set Ci, ... , of vectors which span a 
parallelogramm of volume 1. 

If F is a smooth proper map between two Euclidean spaces and the pullback 
F'^u of an element u G is an element of By duality this gives naturally 

a notion of pushforward of currents, namely F^T{u) = T{F'^u) (cf. [58, eq. 26.20]). The 
assumption that F is proper is needed to guarantee that F'^u has compact support if u 
has compact support. This is however not needed when spt(T) is compact: since this will 
be mostly the case, in several occasions we will push currents forward via maps which are 
just smooth. 

Moreover, when hi C hi' are open and T is a current on hi' we can easily restrict its 
action to forms supported in hi: for such restriction we use the notation TLhI. When T 
has hnite mass, we can define TlF for any Borel set E and consider it as a current in the 
original domain of dehnition by simply setting 

TlE{u):= [ {u{p),f{p))d\\T\\{p). 

In the future, to make our discussion simple, we will say that two currents T and S agree 
on an open subset hi if TLhI = S'Lhi. If the currents have (locally) hnite mass, this 
notion can be extended to Borel subsets. We use a similar notation on Borel measures p 
(which we always assume to be dehned on some subset K of the Euclidean space): given 
a /i-measurable E C K, we use the symbol fxLE for the measure p,LE{A) := fj,(A fl E). 

The product of an m-dimensional current T in and an ?fi- dimensional current S in 
R^ is dehned on forms of type u A u with u G ©’"(R^) and z/ G by 

Tx S{ujAiy) :=T{uj)S{u). 

The action is then extended by linearity to general forms in 
If S and T are two currents in R^ we dehne their join S'x( T as 

//s(|[0 ,l]]x^xT) 

where H : R X R^ X 3 (t, x,y) ep- y [I — t)x. Of particular interest for us is the case 
where S is |P] for some point P\ the join |P] x( T is then the cone with vertex P and 
basis T and to simplify the notation we will use Px( T. 

2.2. The Federer-Fleming theory. As already mentioned, general currents allow to 
take real linear combinations of classical snrfaces. Following Federer-Fleming it is instead 
possible to bnild an homology theory with integer coefficients. 

Definition 2.5 (Integral currents, Federer-Fleming, cf. [39, Dehnition 27.1]). A current 
T of locally hnite mass is integer rectifiable if there is a sequence of oriented surfaces 
Sj C R”^+"', a sequence of pairwise disjoint closed subsets Pj C Sj and a sequence of 
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positive integers ki such that 

T{u) = ^ki [ u ^ueV”^. (5) 

T is integral if both T and dT are integer rectihable. 

In the dehnition above it is implicitely assumed that the right hand side of (5) is a 
convergent series. In fact it is not difficult to see that, under the above assumptions, 
ll^ll(^) = kiW^{Ki n ff) = /cjVol™ {Ki n fl), where 'H°‘ denotes the a-dimensional 
Hausdorff measure. 

The space of integer rectihable currents is not any more a linear spaee and there is 
no simple functional-analytic principle which provides a good compactness property. A 
fundamental result in the theory of Federer-Fleming is that, nonetheless, the space of 
integral currents is compact in a suitable sense. 

Theorem 2.6 (Compactness of integral currents, Federer-Fleming, cf. [58, Theorem 32.2]). 
If {T^} is a sequence of integral m-dimensional currents in Q G open such that 

sup(M(T^) + M(9T^)) < oo, 

k 

then there is a subsequence, not relabeled, and an integral m-dimensional current T such 
that Tk ^ T. 

As a corollary we achieve 

Corollary 2.7 (Existence of area minimizing integral currents). Let Z be an {m — 1)- 
dimensional integer rectifiable current and T an m-dimensional integral current with dT = 
Z and M(T) < oo. Then there is an integer rectifiable current Tq such that OTq = Z and 

M(To) = min{M(T) : T is int. rect. and dT = Z} . 

//spt(T) C K for some closed set K we also have the existence of an integer rectifiable 
current Tq such that dT^ = Z, spt(To) C K and 

M(To) = min{M(T) : T is int. rect., dT = Z and spt(T) C K} . 

Indeed both Theorem 2.6 and Corollary 2.7 can be extended to a fairly large class of 
metric spaces, cf. [9]. 

Of course a similar corollary can be drawn via classical functional analysis for general 
currents: if Z is a general m—1-dimensional current, K a closed set and T an m-dimensional 
current of hnite mass with dT = Z and spt(T) C K, then there is an m-dimensional current 
To with (9To = Z, spt(To) C K and 

M(To) = min{M(T) : dT = Z and spt(T) C K} . 

However the latter formulation allows for general real coefficients. 

Example 2 . 8 . Consider the south and north poles S and N in the standard sphere C 
and let Z be the 0-dimensional current |A^] — [S']. For any meridian 7 joining S to N the 
corresponding current [7] is a minimizer of the mass among all currents T with dT = Z 
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and spt(T) C However the same holds for any convex combination A [ 7 ] + (1 — A) |? 7 ] 
where rj is any other meridian and A G [0,1]. In fact one can push this idea even further. 
Let us parametrize the meridians as {ytjtesG where t is the intersection of 74 with the 
equator {x^ = 0} fl If /i is a probability measure on then the current 

To{u):= [ |7t] (w) d/i(t) 

is also a minimizer of the mass (among those currents T with spt(T) C and dT = Z). 

This does not seem a serious issue as there are anyway “classical minimizers” in the 
example above. However, we have the following remarkable theorem (for a very short and 
elegant proof we refer to [ 68 ]). 

Theorem 2.9 (Lavrentiev gap, Young [72]). For every smooth elosed embedded eurve 7 
in define 

M{'y) := inf {M(T) : T is integer reetifiable and dT = I 7 ]} ( 6 ) 

m(7) ;= min{M(T) : 5T = I7]} . (7) 

Then there are 7 b for which M{'y) > 777(7). 

Theorem 2.6 does not exhaust the major results of the foundational paper of Federer 
and Fleming. Indeed we wish to mention three other important cornerstones. First of all, 
the rectihability of the boundary can be recovered from that of the current under the only 
assumption that the boundary has hnite mass. 

Theorem 2.10 (Boundary rectihability, Federer-Fleming, cf. [58, Theorem 30.3]). IfT is 
integer rectifiable and dT has locally finite mass, then T is integral. 

Secondly, any integral current can be suitably approximated by a sequence of “poly¬ 
hedral chains” with integer coefficients. This is the content of the so-called Deformation 
Lemma (see [58, Theorem 29.1 and Corollary 29.3]), whose precise statement would require 
the introduction of some terminology and goes beyond the scopes of this note. The De¬ 
formation Lemma shows that the space of integeral currents is the closure of the space of 
classical chains with integer coefficients in a topology which is very natural for the Plateau’s 
problem. 

We record here a rather useful corollary of the Deformation Lemma. 

Theorem 2.11 (Isoperimetric inequality, Federer-Fleming, cf. [58, Theorem 30.1]). There 
are constants C{m,n) with the following property. Assume S is an integer rectifiable m- 
dimensional current in with dS = 0. Then there is an integral current T with dT = S 
and M(T) < C'(M(^))(”^+hM, 

We hnish this section by introducing a couple of other objects which will be very conve¬ 
nient in the rest of the note. 


Lemma 2.12 (Density). IfT is an integer rectifiable current, then the number 


e(T,p) 


ii„ Jir||(B,(p)) 

r4.0 


( 8 ) 
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exists and it is a positive integer for ||T||-a.e. p. 

As for the density we introduce the notion of tangent planes via a “blow-up procedure”. 
In order to give the corresponding statement we introduce two conventions: 

• First of all, we will consider all m-dimensional planes tt as oriented. Thus, for each 
71 we have a unique integral current [tt]. 

• Given a current T we will denote by Tp ,, the result of translating it so that p 

becomes the origin and enlarging it of a factor r~^. Formally, if ip^r denotes the 
map a; i-A {x — p)/r, then Tp^r ■= Note that when T = |F] for some smooth 

surface F then Tp j. = |<-p,r(r)]. 

Lemma 2.13 (Canonical representation of integer rectihable currents). IfT is an integer 
reetifiable eurrent, then for ||T||-a.e. p there is a unigue plane 7i{p) sueh that 

-t 0(T,p) |7r(p)] asr|0. (9) 

7i{p) will then be ealled the tangent plane to T at p. If the sets Ki and the submanifolds S* 
are as in Definition 2.5, then 7i{p) = TpY^i (the elassieal oriented tangent to Sj at p) for 
W^-a-e. p G Ki. Moreover, i/T||T|| is the Radon-Nykodim deeomposition ofT, then we 
ean find m Borel veetor fields Ci,... with the properties that 

• ei(p),... ,em{p) is a (positively oriented) orthonormal basis of 7i{p) for ||T||-a.e. p; 

• f{p) = ei(p) A ... A Cmip) for \\T\\-a.e. p. 

The outcome of Lemma 2.12 and Lemma 2.13 is that 

T = TQH^LR, 

where: 

• 0 is integer valued, 

• is a countably m-rectihable set (namely the union, up to an "H^negligible set, of 
countably many compact subsets of submanifolds) 

• and T is an orientation for the tangent space to R at 'H'^-a.e. point. 


2.3. The codimension 1 case: sets of finite perimeter. Integral currents of codimen¬ 
sion 1 have a special feature: they can be seen, locally, as boundaries of integral currents 
of “top dimension”. By dehnition, integer rectihable currents T of dimension m + n are 
represented by ^ ki li?*], where the Efs are pairwise disjoint closed subsets of the 

action of iF'j] on a “top form” fdxi A ... A dxm+n is then given by the standard Lebesgue 
integral 

{Eij {fdxi A ... A dxra+n) = f (x) dx . 

JEi 

Definition 2.14 (Caccioppoli sets, De Giorgi, cf. [8, Dehnition 3.35]). A measurable set 
E C with hnite Lebesgue measure is a set of hnite perimeter if its indicator function 

Is is a function of bounded variation, namely if 


divX 


X e G“(R™+^R”^+^) and ||X||co < 1 


P{E) := sup 


< CXD . 
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P(i?) is called the perimeter of E. 

Like the mass, the perimeter can be localized to dehne a Radon measure. In fact such 
measure coincides with the total variation of the distributional derivative DIe, which is 
usually denoted by \D1e\ (cf. [ 8 , Theorem 3.3.6]). If i? is a smooth set and hi is an open 
set, then ||Zi)l£;|| (hi) is the m-dimensional volume of that portion of dE which lies in hi. For 
this reason it is customary to use the notation P{E, hi) for the relative perimeter HT^IeII (^) 
when n is an open set and E is a Caccioppoli set (again cf. [ 8 , Dehnition 3.3.5]). 

The fundamental link between the two theories is then given by the following 

Proposition 2.15 (Currents and sets of hnite perimeter, cf. [58, Theorem 27.6 and 
Corollary 27.8]). Let E be a measurable subset o/R™'+^ with finite Lebesgue measure. |i7] 
is then an integral current if and only if E is a Caccioppoli set. Moreover, in this case 
M{dlEl)=P{E). 

Consider next an m-dimensional integer rectifiable current T with finite mass in 
and let Bp(a;) C be such that spt((9T) fl Pp{x) = 0. Then there are countably many 

sets of finite perimeter Ei and positive integer ki such that 

(i) ||T||(B,(a;)) = E*^*P(^.B,(a;)); 

(ii) T = Y, kid {Eij on Pp{x). 

Theorem 2.6, Corollary 2.7, Theorem 2.10 and Theorem 2.11 are all generalizations of 
theorems proved by De Giorgi for sets of hnite perimeter (see [ 8 , Sections 3.3 and 3.5]). 

2.4. Slicing. The hnal tool which we recall in this section is the slicing theory of currents. 

Proposition 2.16 (Slicing, cf. [39, Section 4.3]). Let T be a current of dimension m in 
U C R™'+"^ with M(T) + M((9T) < cxd. Consider tt : U —)■ R^ Lipschitz with k < m and 
the m — k-dimensional current TLdn defined by 

TLdn{u) := T{dni A ... A duk A u) . 

Then there exists a map R^ 3 11 —)■ {T,7i,t) taking values in the space of m — k-dimensional 
currents which is (weakly) measurable and satisfies the following properties: 

• {T,n,t) is supported in Tr^^df}); 

• TLdn{u) = f (T, tt, t}(Lv) dt for every test form u; 

• ||TL(i 7 r|| = / ||(T, TT, t)|| dt (and thus {T,7i,t) has finite mass for a.e. t). 

Moreover, such slicing map satisfies the following additional properties 

• d{T,7r,t) = {-lfi{dT,7r,t); 

• IfT is integer rectifiable, then {T,n,t) is integer rectifiable for a.e. t. 

When T = 0T'H™'LR is integer rectifiable and m = k then we have the explicit formula 

(T, 7 r,t)= ^ e(a;)0(a;) It] (10) 

where e{x) takes the value ±1 depending on whether the restriction of dn\x to the tangent 
plane T^R oriented by T preserves or reverses orientation (e(x) = 0 when such restriction 
is not injective; for a.e. t, none of the e{x) appearing in the sum (10) vanishes). 
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The slicing map generalizes the “intersection” of a smooth submanifold with the coun¬ 
terimages of a smooth map. 

3. First considerations in the regularity theory 

Consider a current T as in the second statement of Corollary 2.7, and assume that the 
set 77 is a smooth complete submanifold S without boundary. Then T must have the 
following minimality property. 

Definition 3.1. Let 12 C be open and S C be a smooth complete submanifold 

without boundary of dimension m + n. We say that an m-dimensional integer rectihable 
current T with hnite mass is area minimizing in S fl 12 (cf. [58, Dehnition 33.1]) if 

• spt(T) C S; 

• M(T -|- dS) > M(T) for every {m + l)-dim. integral S with spt(S') C S fl 12. 

The above dehnition can easily be extended to the case when T has locally hnite mass, by 
requiring ||T -|- (9S'||(17) > ||T||(1/) whenever 17 is a bounded open set contanining spt(S'). 

In the rest of the paper S will always be sufficiently smooth and the number n will be 
called the codimension of T. All our ehorts will now be dedicated to the following question: 
what kind of regularity is implied by the area minimizing condition? In order to make our 
discussion simpler, we dehne precisely the (interior) singular and regular sets of the current 
T: 

Definition 3.2. We say that p G spt(T) \ spt((9T) is an interior regular point if there is 
a positive radius r > 0, a smooth embedded submanifold T of S and a positive integer Q 
such that TlB,,(p) = Q [T]. The set of interior regular points, which of course is relatively 
open in spt(T)\spt((9T), is denoted by Reg(T). Its complement spt(T)\(spt((9T)UReg(T)) 
is the interior singular set of T and will be denoted by Sing(T). 

Observe that, since we will always argue at the local level and the problem is scaling 
invariant, we can assume all sorts of nice properties upon S (for instance that it is a global 
graph of a smooth function with good bounds on its norms). 

We next state the best theorems proved so far concerning the regularity of area min¬ 
imizing currents. The hrst theorem summarizes the achievements of several outstanding 
mathematicians from the end of the sixties till the nineties: De Giorgi, Almgren, Fleming, 
Simons, Federer, Bombieri, Giusti and Simon. It is fair to say that, with the notable excep¬ 
tion of Simon’s rectihability result, the various aspects of the following theorem have been 
well digested in the mathematical communities of elliptic PDFs and geometric analysis. 

Theorem 3.3 (Regularity in codimension 1). Assume that 72, S and T are as in Definition 
3.1 and that n = 1. Then 

(i) For m < 6 Sing(T) n 72 is empty (Fleming & De Giorgi (m=2), Almgren (m=3), 
Simons (4: < m < 6), see [20, 42, 21, 3, 61] and also [50, 65]j; 

(ii) Form = 7 Sing(T) fl 72 consists of isolated points (Federer, see [40]j; 
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(iii) For m >8 Sing(T) fl has Hausdorjf dimension at most m — 7 (Federer, [40] j and 
it is countably {m — 7)-rectifiable (Simon, [59] j; 

(iv) For every m > 7 there are area minimizing integral currents T in the euclidean 
space for which 'H”^“'^(Sing(T)) > 0 (Bombieri-De Giorgi-Giusti, [ll]j. 

As already mentioned, after discussing the features of the codimension 1 case, the rest 
of the note will be devoted to the understanding of the higher codimension, i.e. n > 2. 
For this case the best results are the following. 

Theorem 3.4 (Regularity in codimension n > 2). Assume that S and T are as in 
Definition 3.1 and that n>2. Then 

(i) For m = 1 Sing(T) fl fl is empty; 

(ii) For m > 2 Sing(T) fl fl has Hasudorjf dimension at most m — 2 (Almgren, [5]j; 

(iii) For every m >2 there are area minimizing integral currents T in for which 

'H™'“^(Sing(T)) > 0 (Federer, [38]j. 

Almgren’s result was subsequently sharpened by Chang (cf. [13]) for 2-dimensional area 
minimizing currents. 

Theorem 3.5 (m = 2, h > 2). Assume that fl, S and T are as in Definition 3.1, that 
n >2 and m = 2. Then Sing(T) fl fl consists of isolated points. 

In Section 5 we will discuss extensively the difficulties that any argument for (ii) must 
face. Almgren’s original typewritten proof was more than 1700 pages long and was pub¬ 
lished posthumously thanks to the efforts of his students Scheffer and Taylor in a book 
of almost 1000 pages. In this note we will describe the main steps of Almgren’s program 
following the papers [26, 27, 30, 28, 29] by Emanuele Spadaro and the author. 

Chang’s result builds heavily on Almgren’s book. Moreover Chang’s paper [13] does not 
provide the proof of one major step of the argument, the existence of a “branched center 
manifold”: the construction of such object requires the understanding of 4/5 of Almgren’s 
monograph and a suitable modihcation of its most obscure and involved part, which gives 
the construction of the “non-branched center manifold” (cf. Sections 12, 13 and 14 below). 
Building upon [26, 27, 30, 28, 29], in forthcoming joint papers with Emanuele Spadaro and 
Luca Spolaor we will give the hrst proof of the existence of a “branched center manifold” 
and extend Chang’s theorem to a large class of objects which are almost minimizing in a 
suitable sense, cf. [34, 31, 32, 33]. That proof (and Chang’s theorem) will however not be 
discussed in this survey. 

In the rest of this section we will recall the preliminaries of the regularity theory, most 
notably the monotonicity formula and its consequences. 

3.1. Compactness. A hrst basic fact about area minimizing currents is that, under the 
same assumptions of the compactness theorem of Federer and Fleming, they are also a 
compact class. 

Theorem 3.6 (Compactness of area minimizing currents, cf. [58, Theorem 34.5]). Let 
be a seguence of submanifolds of of dimension m + n which converge in to S 
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and let be a sequence of integer rectifiable area minimizing currents in of dimension 
m with sup^ M(Tfc) < oo. Assume that dT^ = 0 on some open set and that —)■ T. 

Then 

• T is area minimizing in VL AT,; 

• ||Tfe|| ||T|| in the sense of Radon measures. 

3.2. Stationarity and stability. Given a current T supported in a submanifold T, we 
define the first and second variations along a smooth compactly supported vector held 
X tangent to T in the usual fashion (cf. [39, Section 5.1.7]): if we denote by the 
one-parameter family of diffeomorphisms of S generated by X, then 

S'‘T(X)-A M((4,),r), 

ai 

whenever T has hnite mass. If T has only locally hnite mass, we can still dehne its 
variations as 

ll(1>.)|r||(C/), 

or 

where U is any bounded open set which contains spt(X). 5^T is usually denoted by 5T and 
is called hrst variation. 5'^T is called second variation. As usual we introduce stationary 
and stable currents as follows. 


Definition 3.7. If S is a complete smooth submanifold of and T an integer rectihable 
current with spt(T) C S we then say that 

• T is stationary in S if 6T{X) = 0 for every smooth compactly supported vector 
held X tangent to S; 

• T is stable in S if it is stationary and S‘^T{X) > 0 for every smooth compactly 
supported vector held tangent to S. 

Of course area minimizing currents are stable. 

3.3. The monotonicity formula. Testing the hrst variation condition with radial vector 
helds leads to the classical monotonicity formula (cf. [58, Section 17]). 


Theorem 3.8 (Monotonicity formula). Let T be an integer rectifiable m-dimensional cur¬ 
rent supported in a Riemannian manifold T C IfT is stationary in T, then the 

following formula holds for every x ^ spt((9T) and any 0 < a < p < dist(a;, spt((9T)).- 

||T||(B,(a;)) ||T||(B.(a;)) _ f iV^rP 


lBp(x)\Ba{x) G 


d\\T\\ 


^ J^pix) 


{y-x)-H ((max{r, a}) - p ”) d||T|| {y ), 


where 


• r{y) ;= \y - x\; 

• V^fiy) denotes the projection ofVfiy) on the orthogonal complement of the tan¬ 
gent plane to T at y; 
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• H{y) = Cj), where e* is an orthonormal basis of the tangent plane to T at 

y and A^, is the second fundamental form ofYi (namely Aj]{f^,ri) = —where 

denotes the projection of the vector field X on the orthogonal complement of 
the tangent to T,). 

A first obvious consequence of the monotonicity formula is that the density of an area 
minimizing current is in fact dehned at every point; thus from now on we will use 0(T, p) 
(cf. (8)) as a well defined quantity for every p ^ spt((9T). Indeed it is a simple exercise to 
show that 0 is an upper semicontinuous function (and this because, modulo some small 
technicalities, the map p ||T||(Br(p)) is continuous at each hxed r). 

We next describe another crucial consequence of the monotonicity formula. Let T be 
an area minimizing current (in the euclidean space, to simplify our discussion) and p G 
spt(T) \ spt((9T). Recall the homothetic rescalings Tp^r- R is obvious that each Tp^r is an 
area minimizing current in tp,.(S). Observe also that: 

• For each bounded open set hi, we clearly have spt((9Tp^r) fl hi = 0, provided r is 
small enough; 

• For each i? > 0 we have a uniform bound for ||Tp^r|| (Bij(O)): the latter is indeed 
the number r^^HTH (Bijr(p)), which is bounded independently of r thanks to the 
monotonicity formula; 

• tp,r(S) converges, in to the tangent plane to S at p. 

Thus, by Theorem 3.6, for every hxed bounded open hi we can extract a subsequence 
{^p.T-fcjr-fcto which converges in hi to an area minimizing current Tq. Actually, by a standard 
diagonal argument we can hnd a “global” limit current Tq which is an integral current on 
each bounded open subset of which has no boundary and which is area minimizing 

in TpS. Since the latter is a linear subspace of R™+^ it is not difficult to see that Tq is then 
area-minimizing in R™'+". 

Now, it is also obvious that r i-A r^'^HToH(B^) is a constant and equals UmQ(T,p). The 
right hand side of the monotonicity formula implies that this is possible if and only if Tq is a 
cone with vertex 0, namely (to,r)i;To = Tq. The standard terminology for Tq is tangent cone 
to T at p: note however that its uniqueness (i.e. the convergence of every sequence Tp^rk to 
the same cone) is not guaranteed and in fact it is one of the major unsolved problems in 
the field. The uniqueness has been proved for 2-dimensional currents T in any codimension 
by White in his remarkable paper [67] and it has been shown by Simon in codimension 
1 at any isolated singularity in the fundamental work [57]. The latter result is indeed a 
consequence of a remarkably general approach, which applies to other variational problems 
(such as the uniqueness of tangent maps to energy minimizing maps) but also to the study 
of the asymptotic behaviour of solutions to parabolic equations, see [57]. 

This motivates the following 

Definition 3.9 (Tangent cones). An area minimizing cone of dimension m in R™'+’^ is 
an area-miminimizing integer rectihable current S of dimension m such that dS = 0 and 
So,r = S for every positive r. Next, if T and S are two currents such that, for some 
p G spt(T) and some r^ f 0, Tp^ru converges to S, we then say that S is tangent to T at p. 




14 


CAMILLO DE LELLIS 


We summarize in the following theorem the most important consequences of the mono¬ 
tonicity formula: 


Theorem 3.10 (Tangent cones, cf. [58, Section 7.3]). LetT be an area minimizing integral 
eurrent of dimension m in a submanifold S of dimension m + n in Then 

(i) r i-A e'"^^r“™||T|| (Br(p)) is a monotone function for each p ^ spt((9T) on the 
interval ]0, dist(p, spt((9T)[, where C is a suitable dimensional constant and A is 
the norm of the second fundamental form of S; 

(ii) The density Q(T,p) is well defined at every p ^ spt{dT), it is at least 1 at each 
point p G spt(T) \ spt((9T) and it is upper semicontinuous; 

(iii) For every p ^ spt((9T) and every seguence rk i 0 there is a subseguence, not rela¬ 
beled, and an area minimizing cone Tq such that Tp ,.,. —t Tq; Tq ^ if and only if 
p e spt(T); 

(iv) If Tk and are as in Theorem 3.6, then spt(Tfc) converges, locally in hi in the 
sense of Hausdorjf, to spt(T). 

One technical detail that plays a crucial role in the proof of Theorem 3.4 is the following 
“improvement” of Theorem 3.10(i) 

Theorem 3.11 (Cf. [30, Lemma A.lj). There is a constant C{m,n,n) with the following 
properties. Let T, S and A be as in Theorem 3.10. Then 


r I—)■ e 




l|r||(B4p)) 


is monotone 


on the interval ]0, dist(p, spt((9T))[ for every p ^ spt((9T). 


4. The regularity theory in codimension 1 

The hrst breakthrough in the regularity theory is due to De Giorgi: he realized in his 
fundamental work [20] that in codimension 1 the existence of one flat tangent plane at 
p is enough to conclude that p is a regular point. His theorem was then extended to 
any codimension by Almgren in [2] (see also [55]) under an important assumption on the 
density which we will discuss extensively in a moment (indeed the latter theorem can be 
suitably generalized to Hilbert spaces, cf. [7]). In fact Almgren’s statement covers many 
more geometric functionals, which satisfy an appropriate ellipticity assumption. In the 
framework of minimal (i.e. only stationary) surfaces the most important generalization of 
De Giorgi’s £-regularity theorem is due to Allard in [1] (cf. also [58, Chapter 4] and [23]): 
his theorem, valid for a far reaching generalization of classical stationary surfaces (namely 
integer rectihable varifolds with sufficiently summable generalized mean curavature) is the 
starting point of a variety of applications of the minimal surface theory to geometric and 
topological problems. 

We will first recall the De Giorgi-Almgren e-regularity theorem in all dimensions and 
codimensions, after introducing the key parameter of “flatness” 
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Definition 4.1 (Spherical excess). Let T be an integer rectifiable m-dimensional current 
and TT be an m-dimensional plane, oriented by the unit simple m-vector tt. The (spherical) 
excess of T in the ball Bp(p) with respect to tt is the quantity 

_... 1 r ,. ___ 


E(T,B,(p),7r) := 


|T(a;)-7f|^d||T||(a;). 


The excess in B,(p) is 

E(T, Bp(p)) ;= min{E(T, Bp(p), tt) : tt is an oriented m-plane} . (13) 

If TT achieves the minimum in the right hand side of (13) we then say that tt optimizes the 


Since we will often deal with m-dimensional balls in m-dimensional planes tt, we intro¬ 
duce here the notation Br{p, n) for the set Br(p) fl (p -|- tt). 

Theorem 4.2 (5- regularity). Let T be an m-dimensional integer rectifiable area minimizing 
current in a submanifold S of dimension m + fi . There are constants a > 0, e > 0 
and C, depending only upon m and n, such that the following holds. Assume that for some 
p > 0 and some m-dimensional plane tt we have 

(a) (9 TlB2p(p) = 0; 

(b) Q(T,p) = Q and Q = Q ||T||-a.e. on B 2 p(p), for some positive integer Q; 

(c) ||T||(B 2 p(p))<(gn;™ + £)(2p)-; 

(d) E ;= E(T,B2p(p),7r) < e and pA ;= pmaxsnB 2 p(p) l^s| < £• 

Then TLBp(p) = Q [T] for a surface T which is the graph of a suitable function 
u : Br{p,TT) — )■ TT"*-. Morcovcr [Du]o^a < -L pA)p““. 

From now on we will often encounter oriented graphs of (at least Lipschitz) functions 
u and the corresponding integer rectifiable currents: the latter will be denoted by G„, 
whereas we will use Gr(n) for the set-theoretic graph. 

4.1. De Giorgi’s idea. The essential point in the proof of Theorem 4.2 is that, under 
the above assumptions, the current T is close to the graph of an harmonic function v 
and hence the excess in a ball is close to the mean oscillation of the gradient of Dv\ 
since the latter quantity has nice decay properties for harmonic functions, it is possible to 
show that the spherical excess of T decays suitably. We illustrate roughly this well-known 
idea because we want to stress that the Holder exponent a in Theorem 4.2 can be made 
arbitrarily close to 1. Presently this is a very minor observation because any graph 
which is stationary for the area functional enjoys higher regularity by the classical Schauder 
estimates. However we will see later that the remark is crucial in the proof of Theorem 
3.4. 

More precisely, assume a-priori that T is the graph G^ of a Lipschitz function n : p-|-7r ^ 
TT"*-. For every hi C tt we can compute the mass of T in the cylinder C := H x tt"*- using the 
area formula: 

M(TlC) = [ 

Jn 
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where with M^{Du) we denote an arbitrary k x k minor of Du. 

The smallness of the excess E(T, B 2 p(p), tt) indicates that most of the tangents to Gu 
are close to the horizontal plane vr: if this control were uniform we could make a Taylor 
expansion of the integrand to achieve 

M(TlC) = |ff|+ ^ [ \Du\^ + 0{\Du\^). 

^ Jn 

Thus we can assume that u is rather close to a minimizer of the Dirichlet energy, i.e. that 
it is close to an harmonic function v. 

Recall moreover M(TLC) — |r2| = ^ JqIT — 7fp(i||T||. Similarly, we can compare 
E(T, B 2 p, tt) to the average integral 


\Du\^ 

B2p(pD 

and analogous computations show that E(T, Bp(p)) is comparable to a similar “optimized’" 
quantity 


a 


mm , 

^ Jb 


\Du-A\^ = 


B pip, it) 


B pip,- it) 


Du 


Du 


B pip,- it) 


For harmonic functions v we have the following decay estimate, which could be proved using 
the expansion of the trace v\dBpip,-K) in spherical harmonics (see [64, Chapter 5, Section 2]) 


B pip,- it) 


Dv 


Dv 


B pip,- it) 


< - 
- 4 


\Dv\\ 


B2pip,-rr) 


We could then hope to transfer such decay to the current in the form 

E(T,B,(p)) < 2-2+2^E(T,B2p(p),7r), 


(14) 


(15) 


where the constant 5 > 0 takes into account (quite a few) error terms, which must be 
carefully estimated. 

Note however that we could optimize on the plane in the right hand of (15) to achieve 

E(T, B,(p)) < 2-2+25 e(T, B2,(p)) . (16) 

In turn this latter estimate would imply that the assumption (d) of Theorem 4.2 holds 
also in the ball Bp(p). Since all other assumptions are automatically satisfied at any scale 
smaller than 2p (the monotonicity formula plays a crucial role here), we could then iterate 
the argument to obtain the decay 

E{T,Br{p)) < . (17) 


However, if we choose e sufficiently small, this process can be applied replacing Br(p) with 
any ball Br{q) such that q G Bp/ 2 (p) and r < p. We have thus achieved a “Morrey-type” 
estimate which easily shows that spt(T) nBr /2 is contained in the graph of a function 
u, where a = 1 — S. The Constanty theorem (cf. [58, Theorem 26.27]) implies that T must 
be (an integer multiple of) G„ in B,./ 2 (p). 
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Remark 4.3. As already mentioned, the argument sketched above leads to the conclusion 
that a in Theorem 4.2 can be taken arbitrarily close to 1, at the price of making the 
threshold e suitably small and the constant C fairly large. For a detailed proof we refer the 
reader to [25, Corollary 2.4 and Appendix A]. The subsequent generalizations of Almgren 
[2, 4], Allard [1] and other authors (cf. for instance [10] and [55]) of De Giorgi’s e-regularity 
statement do not imply the same decay of the excess by simply halving the radius, but seems 
to require a more subtle adjustment of the parameters: a (small) sacrihce to flexibility, since 
the latter results can be applied to much more general objects and situations. 

4.2. First consequences of the e-regularity theorem. It is rather simple to see that 
the conditions (a), (c) and (d) will be met at a sufficiently small radius p as soon as 
p G spt(T) \spt((9T) and there is at least one flat tangent cone at p. However condition (b) 
discriminates severely between the codimension 1 case (h = 1) and the higher codimensions. 
Indeed, Propostion 2.15 shows that in codimension 1 a current without boundary can be 
described as a “superposition” of boundaries of hnitely many Caccioppoli sets Ei. In 
particular, we can reduce the regularity question for area-minimizing currents T under the 
additional assumption that 

T = d |i?] for some measurable E C Tj with < oo. (18) 

Under the additional assumption (18) the existence of one flat tangent cone at a point 
p guarantees that 0(T,p) = 1, which in turn implies condition Theorem 4.2(b) for a 
sufficiently small p, because of the upper semicontinuity of the density. We then achieve 
the following corollary (cf. [58, Section 37]). 

Corollary 4.4. If T is an area minimizing eurrent of dimension m in a submanifold 
S of dimension m + 1, then any point p at whieh there is a fiat tangent cone is a regular 
point. 

Indeed the “sheeted structure” given to codimension 1 integral currents by Proposition 
2.15 has a much stronger consequence, which we record in the following 

Proposition 4.5. Let T, S, and be as in Theorem 3.6 and assume that the codi¬ 
mension n is 1. IfT is regular in hi, then for any open set F CC is regular in F for 

k large enough. 

For the sake of our future discussions we will rephrase the proposition above in the 
following equivalent way, underlying that “singularities persist in the limit”: we will stress 
later on that this persistence can be seen as the major difference between the codimension 
1 and the higher codimension. 

Proposition 4.6 (Persistence of singularities in codimension 1). Let T, S, Tk and 
Sfc, be as in Theorem 3.6 and assume that the codimension n is 1. If pk E Sing(Tfc) and 
Pk ^ p E li, then p G Sing(T). 

In higher codimension there is no analog of Proposition 2.15 which allows to bypass 
Assumption (b) in Theorem 4.2 and we will see that indeed Corollary 4.4 fails, together 
with the stronger propositions stated above. 
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The upper semicontinuity of the density and an elementary argument implies only that 
Assumption (b) of Theorem 4.2 holds in a dense set, leading to the well-known 

Corollary 4.7 (Density of Reg(T), cf. [58, Theorem 36.2]). If T is an area minimizing 
current of dimension m in a submanifold S of dimension larger than m+1, then Reg(T) 
is dense in spt(T) \ spt((9T). 

Indeed this statement has been recently extended to any Hilbert space, cf. [7]. Re¬ 
markably, Corollary 4.7 was the best regularity result available before the appearance of 
Almgren’s manuscript [5] with its proof of Theorem 3.4. In fact this is the current situa¬ 
tion for stationary integer rectifiable m-dimensional varifolds: Allard’s theorem gives the 
regularity up to a meager closed set, even in codimension 1, and this is up to now the best 
regularity result available in the literature for stationary objects. In particular it is not 
known that the singular set is "H^-negligible, not even in the simplest setting of stationary 
2-dimensional varifolds in 3 dimensions. For stable hypersurfaces a rather satisfactory the¬ 
ory is instead available thanks to the pioneering works of Schoen - Simon - Yau [56] and 
Schoen - Simon [54] and to the recent ones of Wickramasekera [70]. 

4.3. Pull regularity for m < 6 and h = 1: Simons’ theorem. Let us now focus on the 

case of codimension h = 1. Corollary 4.4 naturally leads to discuss the existence of area 
minimizing (hyper-)cones which are not flat. The investigations upon this question were 
started by De Giorgi and Fleming who could show full regularity for m = 2, cf. [42] and 
[21]. Moreover, De Giorgi showed that the problem of deciding whether every codimension 
1 area minimizing cone in is flat is equivalent to decide whether any entire minimal 

(hyper-) graph in is affine, the so-called Bernstein problem. The result of De Giorgi 

and Fleming was subsequently improved by Almgren ([3], m = 3) and hnally by Simons 
in [61] to show full regularity for m < 6. 

Observe that if we know full regularity for area-minimizing (hyper)currents of dimension 
m < mo then there is no singular area minimizing hypercone in R™'°+^. As a consequence 
any area minimizing hypercone in R™o+2 is necessarily regular except possibly at the origin; 
thus the cross-section is a minimal (i.e. stationary) embedded hypersurface of the standard 
(mo-l-l)-dimensional sphere. In fact the cone 0)^ |F] over a closed submanifold F of (9Bi(0) 
is stationary in Bi(0) if and only if F is stationary in (9Bi(0). However the minimizing 
property is a stronger information: in particular 0;^ |F] is necessarily stable. The famous 
theorem of Simons is the following statement. 

Theorem 4.8 (Simons). Let 2 < m < 6. Any stable minimal hypersurface o/R™'"*'^ which 
is a cone over a minimal submanifold o/(9Bi(0) is necessarily an m-dimensional plane. 

Theorem 4.8 and Corollary 4.4 settle the regularity of area minimizing currents of di¬ 
mension m < 6 in codimension 1. 

4.4. Simons’ cone and the Theorem of Bombieri-De Giorgi-Giusti. In his cele¬ 
brated paper [61] Simons provided also an example which showed the optimality of his 
theorem. More precisely he showed that the cone over x C namely 

S ■.= {x & x\ -\- x\ -\- xl x\ = xl -\- xl xl} 


(19) 
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is stationary and stable. The surface is usually called Simons’ cone in the literature which 
followed [61]. Later Bombieri, De Giorgi and Giusti in [11] showed that S is indeed an area 
minimizing cone and were thus able to settle the Bernstein problem in all dimensions. 

Theorem 4.9 (Bombieri-De Giorgi-Giusti). S in (19) is an area minimizing eurrent in 
and therefore for any n > 8 there are functions m : M" —)■ R which satisfy the minimal 
surface equation and are not affine. 

For an elegant and simple proof we refer the reader to [35]. 


4.5. Federer’s reduction argument. In this section we review the basic idea behind 
Federer’s partial regularity theorem for m > 7, namely Theorem 3.3(ii)&(iii). The crucial 
(elementary!) observation is that, if S is an area-mininizing cone (with vertex at the 
origin) in R™'+'^ and Sq is a tangent cone to S' at a point p ^ 0, then S'o is invariant under 
translations in direction p. It is then not difficult to conclude that S'o “splits off a line”. 


Lemma 4.10 (Splitting off lines, cf. [58, Lemma 35.5 and proof of Theorem 35.3]). If S'o 
is a tangent cone to an area minimizing m-dimensional cone S in R™'+'^ at a point p ^ 0, 
then S'o = |R] x Z for some {m — 1)-dimensional area minimizing cone Z in 


Thus, if we £x an area-minimizing cone S' of dimension 7 in R® we conclude from Theorem 
4.8 that any cone tangent to S'o at a point which is not the origin must necessarily be flat 
and Corollary 4.4 implies that S' is smooth except, possibly, at the origin. Let now T 
be an area-minimizing current of dimension 7 in a Riemannian submanifold S C of 
dimension 8 and fix a point p G Sing(T). If p were not an isolated singularity, there would 
be a sequence of singular points {pk} converging to p and, upon setting := \pk — p\ we 
could consider the rescaled currents ■ These currents would have a singularity at 0 and 
a singularity at Up to subsequences we could assume that qk converges to some 

q with |g| = 1 and that converges to some tangent cone S. S is an area-minimizing 
cone in TpS, which is an 8-dimensional euclidean space. The above discussion gives that S 
is regular outside the origin, but Proposition 4.6 would imply that the point g is a singular 
point for S. 

A repeated application of Lemma 4.10 allows to infer a similar conclusion for any m > 8. 
In particular, assume that for some m > 8 there is an area-minimizing current of dimension 
m in a Riemannian manifold S of dimension m-|-l such that 'H™“^^"(Sing(T)) > 0. By the 
classical results on densities (cf. [58, Theorem 3.2]) there is a point p G spt(T) \ spt((9T) 
such that 


lim sup 

r4,0 


77™-^+'"(Sing(T) n B,(p)) 

pm—7+a 


> 2 


{m—7+a) 


We can thus assume the existence of an area minimizing cone S of dimension m in TpS 
and of a subsequence of rescalings converging to it for which 


hmsup77”^-^+“(Sing(Tp,^J n Bi) > . (20) 

k^oo 


In addition, if F is the limit, in the Hausdorff sense, of SingiTp^rt) G Bi, we conclude from 
Proposition 4.6 that P C Sing(S'). It is well known that the Hausdorff measures are not 
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upper semicontinuous under Hausdorff convergence and thus we cannot use (20) to conclude 
'H”^“'^+"(Sing(S')) > 0. On the other hand the measures are upper semicontinuous, 
(cf. [58, Appendix A]). Hence if we choose a point p such that 


lim sup 

r4,0 


?^^-^+"(Sing(T)nB,(p)) 

7+a 


^ c2 — (m—7+a) 


(which is possible, cf. [58, Theorem 3.6]) and argue as above with p in place of p, we then 
hnd an area-minimizing cone S with the property that 'H™“^’''“(Sing(S')) > 0. We can 
then repeat the procedure to hnd a second cone Sq, tangent to S at some g 7 ^ 0, with the 
property that 'H™“’^’''"(Sing(S'o)) > 0. Since So has the product structure given by Lemma 
4.10, we infer the existence of an area-minimizing current Si of dimension m — 1 in 
such that 'H^~^^~^^“(Sing(S'i)) > 0. Repeating this argument {m — 7) times we arrive at 
an area minimizing current Sm -7 of dimension 7 in which has 'H[^(Sing(S'm- 7 )) > 0, 
contradicting our previous conclusion that Sing(S'm- 7 ) is a discrete set. 


4.6. Simon’s rectifiability result. We complete our survey of the regularity results in 
codimension 1 by mentioning Simon’s spectacular achievement: combining his fundamen¬ 
tal theorem about the uniqueness of tangent cones at isolated singularities with several 
additional innovative ideas, he has been able in the nineties to show that, when n = 1 , 
Sing(T) can be covered, up to a set of "H^^^-measure zero, by a countable collection of 
{m — 7)-dimensional submanifolds, cf. [59]. A new proof of Simon’s theorem, which 
avoids the discussion of the uniqueness of tangent cones at isolated singularities, has been 
very recently found by Naber and Valtorta, see [49]. This is till now the best description 
available for the behavior of the singular set in codimension 1 . 


5. FEDERER’S theorem and the failure of e-REGULARITY IN CODIMENSION n>2 

5.1. Holomorphic subvarieties as area minimizing currents. We start by recalling 
that holomorphic subvarieties of namely zeros of holomorphic maps u : —)■ O 

{k and j being, respectively, the complex dimension and codimension of the variety) can 
be given a natural orientation. In what follows we identify with in the 

usual way: if zi,...,Zk+j are complex coordinates and Xj = Rezj, pj = Imzj, we let 
Xi,yi,... ^Xk+jiVk+j be the standard coordinates of R^^+^L Recall then that an holo¬ 
morphic subvariety T of of complex dimension /c is a (real analytic) submanifold of 
R 2 fc+ 2 i y Sing(r) of (real) dimension m = 2k, where Sing(r) is an holomorphic subvariety 
of complex dimension k — 1. 

Furthermore, at each point p G F \ Sing(r), the (real) tangent 2A;-dim. plane TpF can 
be identihed with a complex /c-dimensional plane of C". If ui,..., Ufc is a complex basis of 
TpF, we can then dehne a canonical orientation for TpF using the simple 2A;-vector 

Re Ui A Im Ui A ... A Re Ufc A Im Ufc . 

This allows us to dehne the current [F] by integrating forms over the oriented submanifold 
F \ Sing(r). It is also easy to check that i9|r] = 0, the reason being that the “singular 
set” Sing(r) is a set of (locally) hnite 


measure. 
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The discussion can be “localized” to holomorphic subvarieties in open subsets of 
(and more generally in complex hermitian manifolds). Note also that, if is a bounded 

open subset of the domain where T is dehned, then |r] has hnite mass in and it is thus 

an integer rectihable current. The following fundamental observation is due to Federer and 
is based on a classical computation of Wirtinger ([71]). 

Theorem 5.1 (Federer, cf. [39, Section 5.4.19]). Let Fi, ... F^v be holomorphic subvarities 
of complex dimension k in Q C and let ki,... be positive integers. Then the 

current T := /ci |Fi] + ... + [Fty] is area minimizing in 12. 

Indeed the above theorem holds in general Kahler manifolds, cf. [39, 5.4.19]. 


5.2. Branching phenomena. Before giving an idea of why Theorem 5.1 holds we want 
to illustrate the deep consequences that it has in the regularity theory for area minimizing 
currents in codimension higher than 1. Holomorphic subvarieties give easy counterexamples 
to Corollary 4.4 when h > 1: assumption (b) in Theorem 4.2 is absolutely crucial in this 
case. As a byproduct even Proposition 4.6 fails and singularities might disappear in the 
limit when we deal with sequences of area minimizing currents in codimension higher than 
1: in the rest of this note we will see that the core difficulty in the proof of Theorem 3.4 is 
precisely this phenomenon of “disappearance of singularities”. We illustrate these points 
with three explicit examples. 


Example 5.2. Let 5 > 0 be a small number and consider the holomorphic curve 

F 5 := {{z,w) E : z^ = Swj 


and the plane 


TT := {(z,w) E C'^ : z = 0} . (21) 

There is no neighborhood of 0 where F 5 is the graph of a function = f{w), in spite of the 
fact that EdF^] ,Bi( 0 ), 7 r) converges to 0 as <5 0. In fact the conclusion of Theorem 4.2 

does not apply: although each F^ is smooth and it is graphical in Bp(0) for any p, there is 
no uniform control of the norm of the graph in terms of the excess. Observe that the 
F 5 do not satisfy the condition (c) in Theorem 4.2, although they satisfy (a), (b) and (d). 


Example 5.3. Consider the holomorphic curve 

F := {{z, w) eC^ ■. z^ = w^} . 

The origin belongs to Sing(|F]). On the other hand: 

• The unique tangent cone at 0 is given by 2 [tt] for tt as in (21). 

• The density of |F] equals 2 at 0; 


limE(|F],BdO), 7 r) = 0. 

r4,0 

Therefore: 

• Corollary 4.4 is false for 2-dimensional area minimizing currents in R^: F is singular 
at the origin in spite of the existence of a flat tangent cone there. 
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• Again Theorem 4.2 does not apply in any ball B2p(0). Note however that the only 
missing assumption is (b): the density 0(|r] ,p) equals 1 at every point p G r\{0} 
and equals 2 at p = 0. 

• Proposition 4.6 fails for 2-dimensional area minimizing currents in Indeed 0 is a 
singular point for iPJo^ for every positive r > 0. On the other hand iPJo^ 2 [tt] 
and thus 0 is not a singular point of the limit; the singularity “has disappeared”. 

Example 5.4. Consider dually the holomorphic curve 

S := {{z, w) eC'^ : {z - . 

All the considerations valid for the holomorphic curve P of Example 5.3 are also valid for H. 
S does not add much for the moment to our discussion, but it will play a crucial role later: 
observe that 0 is a singular point in spite of the fact that S is an almost imperceptible 
perturbation of the smooth current 2 {{z = 

We close this section by remarking that Theorem 5.1 gives also a great abundance of 
singular area minimizing cones in higher codimension: the zero set of any homogeneous 
polynomial P{zi,... ,Zk+i) in k + 1 complex variables is an area minimizing cone of di¬ 
mension 2k in More generaly, for any projective subvariety of P^C with complex 

dimension j we can construct a corresponding area-minimizing cone in of dimension 

2j + 2. These cones are singular except when the corresponding algebraic subvarieties are 
affine. The easiest example of a singular area minimizing cone is thus the union of an ar¬ 
bitrary number of complex lines in C^. Such cones might however be considered “mildly” 
singular: in the generic cone associated to a projective curve of P^C has a singular set 
which behaves in rather complicated way. 

5.3. Calibrations and the proof of Theorem 5.1. We illustrate here the simple, yet 
deep, principle lying behind Theorem 5.1. Recall first the notion of comass of a form, given 
in Definition 2.3. 

Definition 5.5 (Calibrations, cf. [47]). A calibration ca is a closed m-form such that 
11 a; lie < 1. An integer rectifiable current T is said to be calibrated by a calibration uj if 
{ujp,f{p)) = 1 for ||T||-a.e. p. 

Observe in particular that M(T) > T{uj) whenever ca is a calibration and that the 
equality sign holds if and only if T is calibrated by u. The following is then a trivial fact 

Lemma 5.6. IfT is calibrated by a calibration u, then T is an area minimizing current. 

Proof. Let S be an [m -|- l)-dimensional integral current. Then 

M(T) =T{u) = T{u) + S{du) = {T + dS){u) < M(T + dS). □ 

Holomorphic subvarieties are the primary example of calibrated currents and this obser¬ 
vation dates back essentially to Wirtinger. More precisely, if Z( = X£ + iy^ are the standard 
coordinates in consider the Kahler form 

u := dxi A dyi dxk+j A dyt+j ■ 

Wirtinger’s theorem can then be stated in the following form 
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A V 2 k\ if and only if 

Calibrations and calibrated submanifolds are a rich source of interesting geometries: we 
refer the reader to [47] for several important examples. 

6. AlMGREN’S STRATIFICATION 

From now on we will focus on area minimizing currents in codimension n > 1 and 
we proceed in describing the proof of Theorem 3.4. The hrst step in the analysis is the 
elegant generalization of Federer’s reduction argument known as Almgren’s stratification 
and widely used to analyze singularities in geometric analysis. 

Given an area minimizing m-dimensional cone S we dehne its spine as the vector space 

V of maximal dimension for which S can be written as S' x |C], where S' is an area 
minimizing cone of dimension m — dim (V). Equivalently, V is the subset of those vectors 

V such that S is invariant under translations in direction v and it is a simple exercise 
(using the monotonicity formula) to show that V can be characterized as the subset of 
those points p G spt(5') such that Q{S,p) = Q{S, 0) or, equivalently, as the subset of those 
points q such that Sg^i is also an area minimizing cone (cf. [58, Proof of Lemma 35.5]). 

At the intuitive level it is clear that S must have a certain “asymmetry” in the directions 
which are transversal to V. The dimension of the spine of S is called the building dimension 
of the cone S (cf. [69]). Note that such building dimension equals m if and only if S is an 
integer multiple of an m-dimensional plane, namely if and only if S is flat. 

Definition 6.1. A point p G spt(T) belongs to the stratum Sk{T) if every tangent cone 
to T at p has building dimension at most k and if there is at least one tangent cone to T 
at p with building dimension k. 

Almgren’s generalization of Federer’s argument can then be stated in the following the¬ 
orem. 

Theorem 6.2 (Almgren’s stratihcation, cf. [69]). For any given area minimizing current 
T the stratum Sk(T) has Hausdorjf dimension at most k and So(T) is a discrete set. 

Observe that the discussion of Section 4.5 proves that: 

(F) Given any area minimizing cone S of codimension n = 1, either such cone is a 
multiple of an m-dimensional plane, or its building dimension is at most m — 7. 

As a corollary we conclude that for n = 1 the strata 

Sm-l(T),Sm-2(T), ■■■ ,Sm-6(T) 

are all empty. Next, at any point p in the top stratum Sm(T) there is a flat tangent cone 
and thus, by Gorollary 4.4, we actually know that Sm(T) = Reg(T) (we stress again that 


Theorem 5.7 (Wirtinger, cf. [71]). If u is the Kabler form and 


u = — gj A . A u 
k times 

then is a calibration. Moreover, {u^,Vi A ... A V 2 k) = |ni A ... 
Ui,..., V 2 k is a positively oriented (R-)base of a complex plane. 
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this holds only under the assumption that n = 1: Example 5.3 shows its failure as soon as 
n = 2 and m = 2). We therefore conclude that Sing(T) = Sq{T) U ... U Sm-iiT) and thus 
Theorem 3.3(ii)&(iii) is a corollary of Theorem 6.2. 

Unfortunately from Section 5 we know that the identity Reg(T) = SmiT) does not hold 
anymore when the codimension n > 1. On the other hand we surely have Reg(T) C Sm{T). 
We could call “branch points” for T those points p G Sm{T) \ Reg(T). The major concern 
in the rest of the note will be to estimate the Hausdorff dimension of Sm{T) \ Reg(T). A 
simple consequence of Theorem 6.2 is that, in order to prove Theorem 3.4(ii), the “only” 
concern is truly to bound the Hausdorff dimension of the set of branch points by m — 2, 
because of the following lemma. 

Lemma 6.3 (cf. [58, Theorem 35.3]). The stratum Sm-i(T) is empty in any codimension 
n. 


The proof is very elementary: given the dehnition of the strata we just need to show that 
the building dimension of an area-minimizing m-dimensional cone can never be m — 1; in 
turn this is equivalent to show that 1-dimensional area-minimizing cones are never singular, 
namely they are always single straight lines counted with integer multiplicity. The latter 
statement has a rather trivial proof. 

The proof of Theorem 6.2 is relatively elementary and “soft”. In spite of this the idea is 
powerful and can be applied to several different problems in geometric analysis; for instance, 
we refer the reader to Simon’s work on the singularities of harmonic maps, [60], to White’s 
far-reaching generalization of Theorem 6.2 and its applications to the mean-curvature flow, 
[69], and to recent results about Riemannian manifolds with one-sided curvature bounds, 
see for instance [15]. Recently, in a series of works (cf. [16, 14, 17]), the method of Almgren 
has been extended to deal with the Minkowski content, see also [43] for an abstract general 
version of this. 

We hnally mention that the cones with building dimension m — 2 can be actually further 
characterized: it is not difficult to see that such cones are necessarily unions of multiples of 
m-dimensional planes. The spines of such cones are (m — 2)-dimensional subspaces. Due 
to the remarkable work of White, [67], when m = 2 there is one such unique tangent cone 
at every point p G Sq{T). However, for m > 3 the same uniqueness result is not yet proved 
and in fact it is not even known whether at points p G Sm{T) \ Reg(T) the flat tangent 
cone is the unique one! 

7. Multiple valued functions and the Dirichlet energy 

As already noticed, in codimension 1 the regularity in a neighborhood of a point with 
integer multiplicity Q where at least one tangent cone is flat can be reduced to the case 
of multiplicity Q = 1, whereas the discussions of Section 5 show that this reduction is 
impossible in codimension larger than 1. Indeed, in the Examples 5.3 and 5.4 even the 
starting point of De Giorgi’s strategy as described in Section 4.1 fails dramatically: no 
matter how small is the neighborhood U of the origin that we choose, it is simply not 
possible to approximate efficiently the corresponding current T in U with the graph of a 



THE SINGULAR SET OF AREA-MINIMIZING GURRENTS 


25 


(single valued) function. However, in each of these examples the current turns out to be a 
“multivalued” graph, where the number of values is in fact determined by the multiplicity 
Q = 0(T, 0). This discussion motivates the starting idea of Almgren’s monograph: in 
order to go beyond an Allard’s type statement (namely regularity in a dense relatively 
open subset of spt(T) \ spt((9T)) we need to develop an efficient theory for “multiple 
valued functions” minimizing a suitable generalization of the Dirichlet energy, where we 
can (and we will) consider the multiplicity to be a constant preassigned positive integer Q. 

7.1. The metric space of unordered Q-tuples. The obvious model case to keep in 
mind is the following. Given two integers k, Q with MCD(/c, Q) = 1, look at the set valued 
map which assigns to each point z G C the set M{z) := {w^ : = z} (Z C. Obviously for 

each 2 : we can choose some arbitrary ordering {ui^z).,. .. ,uq{z)} of the elements of the set 
M{z). However, it is not possible to do it in such a way that the resulting “selection maps” 
I—)■ u^{z) are continuous: even at the local level, this is impossible in every neighborhood 
of the origin. 

Our example motivates the following dehnition. Given an integer Q we dehne a Q-valued 
map from a set E G into R” as a function which to each point x E E associates an 
unordered Q-tuple of vectors in R”. Following Almgren, we consider the group of 
permutations of Q elements and we let ^q(R"') be the set (R"')'^ modulo the equivalence 
relation 

(ui, ...,vq) = (n^(i),..., ^^(Q)) W e 

Hence a multiple valued map is simply a map taking values in ^q(R”). There is a fairly 
efficient formulation of this dehnition which will play a pivotal role in our discussion, 
because the set ^q(R’^) can be naturally identihed with a subset of the set of measures 
(cf. [5] and [26, Dehnition 0.1]). 

Definition 7.1 (Unordered Q-tuples). Denote by |Pj] the Dirac mass in G R”. Then, 

^q(R”) for every f = 1 ,..., q| . 

Observe that with this dehnition each element of .4 .q(R"') is in fact a 0-dimensional 
integral current. This set has also a natural metric structure; cf. [5] and [26, Dehnition 
0.2] (the experts will recognize the well-known Wasserstein 2-distance, cf. [66]). 

Definition 7 . 2 . For every Ti,T 2 G .4,q(R"'), with Ti = T 2 = we set 

giT ,, T ,) := min (22) 

Remark 7 . 3 . Since we will often need to compute Q(T,Q |0]) we introduce the special 
notation |T| for the latter quantity. Observe, however, that .4 ,q(R"') is not a linear space 
except for the special case Q = 1: the map T —)■ |T| is not a norm. 
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7.2. Q- valued maps. Using the metric structure on .4 .q(M”) one defines obviously mea¬ 

surable, Lipschitz and Holder maps from subsets of into .4 .q(M”). One important point 
to be made is about the existence of “selections”. A selection for a Q-valued function u is 
given by Q classical single valued functions such that u{x) = |Mj(a;)], cf. 

[26, Definition 1.1]. If the Ui are measurable, continuous, Lipschitz, etc. the selection will 
be called measurable, continuous, Lipschitz, etc. It is rather easy to show that a measur¬ 
able selection exists for any measurable u, cf. [26, Proposition 0.4]. Incidentally, this will 
be used repeatedly as we write 

Ew 

i 

for any given measurable Q-valued map u, tacitly assuming to have chosen some measurable 
selection. 

However continuous maps (resp. Sobolev, Lipschitz) do not possess in general selections 
which are continuous (resp. Sobolev, Lipschitz): the primary examples are the maps 
stemming from holomorphic subvarieties already discussed at length. If, however, they do, 
the corresponding selection will be called regular. Only maps defined on 1-dimensional 
intervals are a notable exception, since they always have regular selections. 

7.3. The generalized Dirichlet energy: geometric definitions. If we want to ap¬ 
proximate area minimizing currents with multiple valued functions and “linearize” the 
area functional in the spirit of De Giorgi, we need to define a suitable concept of Dirich¬ 
let energy. We will now show how this can be done naturally, proposing three different 
approaches. 

Consider the model case oi Q = 2 and assume u : Q ^ .4.2(M*^) is a Lipschitz map. If, 
at some point x, u{x) = |Pi] -|- IP2] is “genuinely 2-valued”, i.e. Pi 7^ P2, then there exist 
obviously a ball Br{x) C D and a regular (Lipschitz) selection, namely Lipschitz classical 
maps Ml,M2 : Br{x) —)■ such that u{y) = |mi(|/)] -|- |m 2 (i/)] for every y G Br{x). On 
the other hand on the closed set where the values of u are “collapsed” we can find a single 
Lipschitz map v such that m = 2 |m] . It is easy to generalize this to the Q-valued case and 
to maps defined on a manifold S: 

Lemma 7.4 (Decomposition, cf. [27, Lemma 1.1]). Let M d T, be measurable and F : 
M —)■ 4.q(M”) Lipsehitz. Then there are a eountable partition of M in bounded measurable 
subsets Mi (i eN) and Lipsehitz funetions // : Mi ^'EL (j G {1,..., Q}) sueh that 

(a) P|Mi = E?=i Ifi] for every ie'N and Lip(//) < Lip(P) Vi, j; 

(b) Vi G N and j,j' G {1,..., Q}, either // = // or fi(x) 7^ // (x) Vx E Mi. 

The Dirichlet energy can be defined for Lipschitz maps P as above by 

Dir(F,E)-E/ WL- ( 23 ) 

ij Xu 

lU^’^ maps and their Dirichlet energy can then be defined by relaxation: assuming that 
D is open IU^’^(D, VIq(M"')) consists of those measurable maps m : D —)■ VIq(M'^) for which 
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there is a sequence of Lipschitz maps Uk converging to v a.e. and enjoying a uniform bound 
Dir(r2, Uk) < C. The Dirichlet energy Dir(r2, v) is the inhmum of all constants C for which 
there is a sequence with the properties above. 

Another possible definition of the Dirichlet energy follows more closely Almgren’s original 
idea: for Q-valued maps we can introduce a notion of differentiability in the following way 

Definition 7.5 (Q-valued differential, cf. [26, Dehnition 1.9]). Let / : D —)■ Aq and 
xo G D. We say that / is differentiable at xq if there exist Q matrices Li satisfying: 

(i) G{f{x),T^J) = o(|a; - Tol), where 

Txofix) := “ ^o) + /i(^o)l ; (24) 

i 

(ii) Li = Lj if fi{xo) = fj{xo). 

The Q-valued map will be called the first-order approximation of / at tq. The point 
Xli 1-^*1 ^ .4 ,q(M"'^™') will be called the differential of / at xq and is denoted by Df{xo). 

A Rademacher’s type theorem shows that Lipschitz Q-valued maps are differentiable 
almost everywhere and that the Dirichlet energy dehned above corresponds to the integral 
of \Df\ = Q{Df,Q |0]) (our notation is consistent, cf. Remark 7.3). The proof is in fact 
a straightforward corollary of Lemma 7.4 and elementary measure theory. 

Proposition 7.6 (Q-valued Rademacher, cf. [26, Theorem 1.13] and [27, Lemma 1.1]). 
Let / : D —)■ Aq he a Lipschitz function. Then, f is differentiable almost everywhere in D 
and 

Dir(/,D)= / \Df\\ 

Jn 

where the left hand side is understood in the sense of (23). In particular the expression in 
(23) is independent of the decomposition given by Lemma l.f. 

Almgren’s dehnition of Sobolev map does not follow, however, a “relaxation procedure” 
but uses the (biLipschitz) embedding of .4 ,q(M”) in a large Euclidean space, see below. 

7.4. and the generalized Dirichlet energy: metric analysis definition. Al¬ 

though the dehnition above is certainly very natural and gives a good geometric intuition 
for the Dirichlet energy, it turns out that it is rather complicated to work with it, in partic¬ 
ular if one wants to recover the usual statements of the Sobolev space theory for classical 
functions. 

Instead, a rather efficient way to achieve such statements is to rely on a more abstract 
dehnition of Dirichlet energy and Sobolev functions, as proposed in [26]. A very general 
theory has been developed in the literature for Sobolev maps taking values in abstract 
metric spaces, following the pioneering works of Ambrosio [6] and Reshetnyak [52, 51]. 
The careful reader will notice, however, that there is a crucial diherence between the 
dehnition of Dirichlet energy in [52] and the one given below. 
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Definition 7.7 (Sobolev Q-valued functions, cf. [26, Definition 0.5]). A measurable / : 
D —)■ Aq is in the Sobolev class (1 < p < cxd) if there exist m functions (pj G M+) 

such that 

(i) a; ^ g{f{x),T) e for all T e Aq] 

(ii) \dj g{f, T)| < (pj a.e. in D for all T G Aq and for all j G {1,..., m}. 

It is not difficult to show the existence of minimal functions pj fulfilling (ii), i.e. such 
that, for any other pj satisfying (ii), pj < pj a.e. (cf. [26, Proposition 4.2]). Such “minimal 
bounds” will be denoted by \djf\ and we note that they are characterized by the following 
property (see again [26, Proposition 4.2]): for every countable dense subset of Aq 

and for every j = 1 ,..., m, 

\djf\ = sup \dj g{f,Ti)\ almost everywhere in D. (25) 

ieN 

We are now ready to give an abstract characterization of the Dirichlet energy. 

Proposition 7.8 (Cf. [26, Proposition 2.17]). If u E is Lipschitz then 

\dju\‘^{x) = \Dui{x) • Cjp (26) 

i 

at a.e. point x of differentiability of u, where \Dui{x)\ is the Q-valued differential in 
the sense of Definition 7.5 and \dju\ is as in (25). 

The main feature of the above proposition is that essentially all the conclusions of the 
usual Sobolev space theory for single valued functions can be now reduced to routine modifi¬ 
cations of the usual arguments: among them we mention Sobolev and Morrey embeddings, 
compact embeddings, Poincare inequalities, semicontinuity results, trace properties (cf. 
[26, Chapter 4]). 

We list here some of these facts and refer to [26] for their proof. 

Definition 7.9 (Trace of Sobolev Q-functions). Let D C be a Lipschitz bounded open 
set and / G Aq). A function g belonging to Aq) is said to be the trace 

of / at dVL (and we denote it by flon) if, for every T G Aq, the trace of the real-valued 
Sobolev function Q{f,T) coincides with Q{g,T). 

Definition 7.10 (Weak convergence). Let fk,f G Aq). We say that fk converges 

weakly to / for /c —)■ oo, (and we write ft f) in Aq), if 

(i) / G{fk, f)^ -t 0, for k ^oo] 

(ii) there exists a constant C such that f \Dfkf < C < oo for every k. 

Proposition 7.11 (Weak sequential closure). Let f G Aq). Then, there is a 

unique function g G LP{dfl, Aq) such that f\dn = g in the sense of Definition 7.9. More¬ 
over, f\dQ = g if and only i/^(/, T)|gn = g{g,T)\dn for every T in the usual sense, and 
the set of mappings 

:= {/ e : f\m = 9 } 

is sequentially weakly closed in 


(27) 
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Proposition 7.12 (Sobolev Embeddings). The following embeddings hold: 

{i) if p < m, then Aq) C for every q G and the inelusion is 

compaet when q < p*; 

{ii) if p = m, then Aq) C L^iVL^Aq), for every q G [1 ,+cxd), with compact 

inclusion; 

{Hi) ifp > m, then W^'^{VL, Aq) C C^’'^{VL,Aq), for a = 1 — with compact inclusion. 

Proposition 7.13 (Poincare inequality). Let M he a connected bounded Lipschitz open 
set of an m-dimensional Riemannian manifold and let p < m. There exists a constant 
C = C{p, m, n, Q, M) with the following property: for every f G W^’P{M, Aq), there exists 
a point f G Aq such that 

Proposition 7.14 (Campanato-Morrey). Let f G W^''^{Bi.,Aq), with Bi C and 
a G (0,1] be such that 


\Df\^<Ar 


m—2-\-2a 


for every y E Bi and a.e. r g] 0, 1 — |i/|[. 


Then, for every 0 < 5 < 1, there is a constant C = C{m,n, Q, 5) with 

_ G{f{x),f{y)) 


sup —^^ 
x,yeBs F y\ 


— ■ [f]c°’°‘(Bs) — C ^/~A. 


Lemma 7.15 (Interpolation Lemma). There is a constant C = C{m,n,Q) with the fol¬ 
lowing property. Let r > 0, g E W^''^{dBr, Aq) and f E W^’^{dBr(i-£), Aq) . Then, there 
exists h E W^’‘^{Br \ Br(i-e),AQ) such that h\ 3 B, = g, = / and 


Dir(/i, Br \ BrQ-e)) <Cer [Dir( 5 (, dB^) + Dir(/, dB^q-e))] + 

+ — / Q{g{x)A'{{^-e)x)f dx. (30) 

JdBr 

7.5. Lipschitz approximation and approximate differentiability. An important fea¬ 
ture of classical Sobolev maps is the existence of suitable smooth approximations. Since 
the space AIq(M") is itself rather singular and lacks any linear structure, the usual ap¬ 
proximation results are indeed much more subtle. However a robust way to approximate 
Sobolev maps is to “truncate them” along the level sets of the Hardy-Littlewood maximal 
function of the modulus of their gradient. This is possible in the setting of Q-valued maps 
as well and will play a crucial role in the sequel. 


Proposition 7.16 (Lipschitz approximation, cf. [26, Proposition 4.4]). There exists a 
constant C = C{m,Q,Q) with the following property. For every f E W^'^{LL, Aq) and 
every A > 0, there exists a Q-function f\ such that Lip (/a) < C A, 
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fl\ Ex can be assumed to contain {x G : M{\Df\) < A}, where M is the maximal 
function operator. 

A simple corollary of the previous proposition is that Sobolev maps are “approximate 
differentiable” in the following sense: 

Definition 7.17 (Approximate Differentiability). A Q-valued function / is approximately 
differentiable in xq if there exists a measurable subset D C D containing xq such that D 
has density 1 at Xq and /|q is differentiable at Xq. 

Corollary 7.18. Any f E Aq) is approximately differentiable a.e. 

7.6. Chain rule formulas: Q-valued calculus. The latter property is very useful to 
extend classical computations like the chain rule to Sobolev maps. Indeed, it is rather 
easy to extend such formulas to Lipschitz maps using the multivalued differentiability: 
Proposition 7.16 can then be used to routinely justify the same formulas for general Sobolev 
maps. 

Consider a function f : Q ^ For every $ : D —)■ D, the right composition / o $ 

dehnes a Q-valued function on D. On the other hand, given a map T : D X —)■ we 

can consider the left composition, x ha fi{x))}, which dehnes a Q-valued function 

denoted, with a slight abuse of notation, by T(a;, /). 

Propositiou 7.19 (Chain rules, cf. [26, Proposition 1.12]). Let / : D —)■ AIq(M"') be 
differentiable at xq. 

{i) Consider <F : D —)■ D such that <h(|/o) = xq and assume that <F is differentiable at 
yo- Then, f o ^ is differentiable at yo and 

D{f o $)(yo) = iDfixo) ■ Dd>(|/o)l. (32) 

i 

{ii) Consider T : x R” —)■ such that T is differentiable at (xq, /i(a^o)) for every i. 

Then, T(a;,/) is differentiable at Xq and 

f)){xo) = lDu^{xo, fi{xo)) ■ Dfi{xo) + D,,^{xo, fi{xo))j . (33) 

i 

7.7. Almgreu’s extriusic maps. The metric Q on AIq(R") is “locally euclidean” at most 
of the points. Consider for instance the model case Q = 2 and a point P = |Pi] + IP 2 ] 
with Pi 7 ^ P 2 . Then, obviously, in a sufficiently small neighborhood of P, the metric space 
Al 2 (R”) is isometric to the Euclidean space R^”^. This fails instead in any neighborhood of 
a point of type P = 2 |Pi]. On the other hand, if we restrict our attention to the closed 
subset {2 |X] : X E R'^}, we obtain the metric structure of R'^. 

A remarkable observation of Almgren is that AIq(R'^) is biLipschitz equivalent to a 
deformation retract of the Euclidean space (cf. [5, Section 1.3]). For a simple presentation 
of this fact we refer the reader to [26, Section 2.1]. 

Theorem 7.20 (Almgren’s embedding and retraction). There exists N = N{Q,n) and an 
injective ^ : AIq(R'^) —)■ R^ such that: 
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(i) Lip(l) < 1; 

(ii) ^fQ = ^{AQ), thenUp{^^^\Q) < C{n,Q). 

Moreover there exists a Lipsehitz map p ; —)■ Q which is the identity on Q. 

In fact much more can be said: the set Q is a cone and a polytope. On each separate 
face of the polytope the metric structure induced by Q is euclidean, essentially for the 
reasons outlined a few paragraphs above (cf. again [5, Section 1.3] or [30, Section 6.1]). A 
simple, yet important, observation of White is that the map can be easily constructed 
so that the Dirichlet energy of ^ o m (as clssical Euclidean map) coincides with that of u 
(as multivalued map) for any u G 

Sobolev maps were initially dehned by Almgren using the map With this artifact 
a lot of the theory outlined in the previous paragraphs can be recovered from the usual 
(single valued) theory using the maps ^ and p. Presently such maps could be avoided for 
essentially all the arguments. However a more complicated version of the map p will play 
a rather important role at a certain point later. As already mentioned, for Q > 1 the space 
^q(M”) is not linear and we cannot regularize Q-valued maps by convolution. Nonetheless 
we will need a way to smooth maps suitably with a procedure which retains some 
of the basic estimates available for convolutions with a standard molliher (in particular 
when computing the energy of the regularizations). A possible approach is to smooth the 
euclidean map ^ou and then “project” it back onto Q using p. However, projecting back 
might be rather costly in terms of the energy since the Lipsehitz constant of p is indeed 
rather far from 1. 

To bypass this problem, we follow Almgren and prove the existence of “almost” pro¬ 
jections, denoted by which are (1 -|- p)-Lipschitz in the ^-neighborhood of ^(PIq(M”)). 
These maps cannot be the identity on Q, but they are at a uniform distance rj from it. 
Almgren’s original proof is rather complicated. In [30, Proposition 6.2] we have proposed 
a different proof which uses heavily Kirszbraun’s extension theorem and seems to yield 
a better estimate of p and rj in terms of 5 (in particular in the version of [30] these are 
suitable positive powers of 5). 

Proposition 7.21 (Almost projection, cf. [30, Proposition 6.2]). For every n,Q G N\{0} 
there are geometric constants So,C > 0 with the following property. For every 5 g ] 0 ,( 5 o [ 
there is ^ Q = ^(^q(M^)) such that \p}{P) - P\ < C for all P e Q 

and, for every u G IP^’^(f2, M-^), the following holds: 

[\D{p*sOu)\^ < + [ \Du\^ + C [ \Du\\ (34) 

8. Dir-MINIMIZERS AND THEIR REGULARITY 

We are now ready to state the main results in the theory of Dir-minimizing maps. In 
what follows, H is always assumed to be a bounded open set with a sufficiently regular 
boundary. 



32 


CAMILLO DE LELLIS 


Theorem 8.1 (Existence for the Dirichlet Problem, cf. [26, Theorem 0.8]). Let g G 
.Aq). Then there exists a Dii-minimizing f G A.q) sueh that /Iqq = g\dn- 

Theorem 8.2 (Holder regularity, cf. [26, Theorem 0.9]). There is a positive eonstant 
a = a{m,Q) with the following property. If f E W^''^{VL]Aq) is D've-minimizing, then 
f G C°’“(r2') for every hi' CC H C M™. For two-dimensional domains, we have the explieit 
constant a{2, Q) = 1/Q. 

For the second regularity theorem we need the dehnition of the singular set of /. 

Definition 8.3 (Regular and singular points, cf. [26, Dehnition 0.10]). A Dir-minimizing 
/ is regular at a point a; G D if there exists a neighborhood B of x and Q analytic functions 
/j : R —)■ M” such that 

fiy) = ^Ifiiy)} for every i/GR (35) 

i 

and either fi{y) ^ fj{y) for every y E B, or fi = fj. The singular set Sing(/) is the 
complement of the set of regular points. 

Theorem 8.4 (Estimate of the singular set, cf. [26, Theorem 0.11]). Let f be Dir- 
minimizing. Then, the singular set Sing(/) is relatively closed in D. Moreover, if m = 2, 
then Sing(/) is at most countable, and ifm>3, then the Hausdorff dimension o/Sing(/) 
is at most m — 2. 

Note in particular the striking similarity between the estimate of the size of the singular 
set in the case of multiple valued Dir-minimizers and in that of area minimizing currents. It 
will be discussed later that, even in the case of Dir-minimizers, there are singular solutions 
(which are no better than Holder continuous). 

Complete and self-contained proofs of these theorems can be found in [26]. The key tool 
for the estimate of the dimension of the singular set is the celebrated frequency function 
(cp. with [26, Section 3.4]), which has been indeed used in a variety of different contexts in 
the theory of unique continuation of elliptic partial differential equations (see for instance 
the papers [44], [45]). This is the central tool of our proofs as well. However, our arguments 
manage much more efficiently the technical intricacies of the problem and some aspects of 
the theory are developed in further details. For instance, we present in [26, Section 3.1] the 
Euler-Lagrange conditions derived from first variations in a rather general form. This is 
to our knowledge the first time that these conditions appear somewhere in this generality. 

Largely following ideas of [13] and of White, we improved the second regularity theorem 
to the following optimal statement for planar maps. 

Theorem 8.5 (Improved 2-dimensional estimate, cf. [26, Theorem 0.12]). Let f be Dir- 
minimizing and m = 2. Then Sing(/) is discrete. 

This result was announced in [13]. However, to our knowledge the proof has never 
appeared before [26]. 

A new addition to the regularity theory, which will have a lot of importance in the 
subsequent discussions, is the following higher integrability result. 
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Theorem 8.6 (Higher integrability of Dir-minimizers, cf. [30, Theorem 5.1]). Let CC 
H CC M™' be open domains. Then, there exist p > 2 and C > 0 sueh that 

W^'^WiPin') — ^ VDir-mmim. m G .4,q(M"')). (36) 

We believe that several intricate arguments and complicated constructions in the third 
chapter of Almgren’s monograph can be reinterpreted as rather particular cases of this key 
observation (see for instance [5, Section 3.20]). Surprisingly, this higher integrability can 
be proved in a very simple way by deriving a suitable reverse Holder inequality and using 
a (nowadays) very standard version of the classical Gehring’s Lemma. 

Theorem 8.6 has been stated and proved for the hrst time in [30]. The relevant reverse 
Holder inequality has been derived using a comparison argument and hence relying heavily 
on the minimality of the Dir-minimizers. A second proof, exploiting the Euler-Lagrange 
conditions to give a Caccioppoli-type inequality, has been given in [63]. This last proof still 
uses the regularity theory for Dir-minimizers. However, this occurs only at one step: one 
could hope to remove this restriction and generalize the higher integrability to “critical” 
points of the Dirichlet energy. 

In [63] a yet different proof for the planar case is proposed, yielding the optimal range of 
exponents p for which (36) holds. The optimality of this result, as well as the optimality 
of Theorems 8.2 and 8.5, is shown by another remarkable observation of Almgren. Besides 
giving area minimizing currents, holomorphic varieties are locally graphs of Dir minimizing 
Q-valued functions. In [5, Section 2.20] Almgren proves this statement appealing to his 
powerful approximation results for area minimizing current. However this is unnecessary 
and a rather elementary proof can be found in [63]. 


8.1. Holder continuity of Dir-minimizers. The entire Section 9 will be dedicated to 
the proof of Theorem 8.4, since it contains, in a simplihed setting, several of the themes of 
the proof of Theorem 3.4. 

In this paragraph we will instead discuss briefly the ideas behind the proof of Theorem 

8.2. We hrst assume that m is a classical (single valued) function and, for simplicity, that 
m > 3 (the case m = 2 is somewhat special and can be handled in a simpler way). Let 
u G IT^’^(i?i,.4 .q(M"')) be a minimizer and compare its energy to the energy of the 0- 
homogeneous extension v of its trace on dBi{x): we achieve the following crude inequality 




\Du\‘^ < 


'Biix) 


\Dv\^ < 



(37) 


A scaling-invariant version of the above inequality can be combined with Fubini’s theorem 
to give the following differential inequality: 


/ \Du\^ < — 

'Br(x) ^ 


d 


2 dr 


'Br{x) 


\Dn\- 


(38) 


which in turn gives the bound Dir(M, H^(a;)) < Cr"^~‘^. If we could improve the constant 

in (38) to ^_ 2 _)_ 2 e ; the same reasoning would give the estimate Dir(M, Hj.(a;)) < 

and Proposition 7.14 would imply the e-Holder continuity of u. Now, for a single valued 
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function u the first inequality in (37) is certainly strict, since v does not satisfy the Euler- 
Lagrange conditions of a minimizer. It is not difficult to see that the very same conclusion 
can be drawn in the multivalued setting. The problem is to gain, in the factor of the 
right hand side of (37), a constant e > 0 which is independent of the function (and, more 
importantly, of the central point x). 

We can therefore focus on improving the constant in the right hand side of (37) and 
without loss of generality we can assume x = 0. It is easy to see that we can assume, again 
without loss of generality, that the Dirichlet energy in i?i(0) is normalized to 1. When u is 
single valued we can also assume that u has average 0 after subtracting a second suitable 
constant: the “uniform gain” from to ^_ 2 + 2 e then a simple consequence of 

the standard compactness of Sobolev maps (via Poincare inequality). However, the multi¬ 
valued Poincare inequality (namely Proposition 7.13) does not give the same normalization 
property. The only well dehned operation is the subtraction of the same value p from all 
Q sheets, namely given u = |nj] we can set 

v{x) = lui{x) -pI . 

i=l 

But the “generalized mean” of Proposition 7.13 is just a generic point in .4 .q(M”). In 
particular we cannot expect compactness when we only control the Dirichlet energy: for a 
general Sobolev map some sheets might be very far apart on a large subset and be very close 
on another, very small, subset. However, it can be shown that if the average separation 
between some sheets of a Dir-minimizer v is too large compared to its Dirichlet energy on 
a given domain, then v must split into simpler functions in a smaller domain. This allows 
to prove that there is a uniform gain in the constant of the right hand side of the inequality 
of (37). The gain will depend upon Q, but this is not an artifact of the proof: as already 
pointed ont, the Holder exponent in Theorem 8.2 does deteriorate to 0 as Q —)■ cxd. 

The above discussion snggests that, although a control on the Dirichlet energy is not 
enough to ensure compactness for a general seqnence of Q-valned functions, a suitable 
form of compactness can be recovered after modifying the sequence on a small set. This 
principle plays a key role in our future discussions and has been formalized in [30] in the 
following way. 

Definition 8.7 (Translating sheets, cf. [30, Dehnition 3.1]). Let D C M™' be a bounded 
open set. A seqnence of maps {hkjksN C IP^’^(D, .4,q(M”)) is called a sequence of translat¬ 
ing sheets if there are: 

(a) integers J > 1 and Qi, • • •, Qj > 1 satisfying Qj = 

(b) vectors yi G M” (for j G {1,..., J} and /c G N) with 

\iin\yl-yl\ =+00 Vi ^ j, 

k 

(c) and maps G IP^’^(D, Aq.) for j G {1,..., J}, 
such that hi = Ej.iIO “ rfl' 


( 39 ) 
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Proposition 8.8 (Concentration compactness, cf. [30, Proposition 3.3]). Let C 
be a Lipschitz bounded open set and {gk)keN C .Ag) a sequence of functions with 

supi. J^\Dgk\^ < oo. Then, there exist a subsequence (not relabeled) and a sequence of 
translating sheets hk such that \\Q{gkihk)\\i ^2 —)■ 0 and the following inequalities hold for 
every open Q' C Q and any sequence of measurable sets Jk with \Jk\ —)■ 0.' 

liminf^ [ \Dgk\‘^ - f \Dhk\A >0 (40) 

fc^>+oo Jq/ J 

limsup f {\Dgk\ - \Dhk\f < limsup f {\Dgk\^ - \Dhk\‘^) • (41) 

fc^-l-oo Ju k Jq 


9. The frequency function 

In this section we review the ideas behind the proof of Theorem 8.4. As already men¬ 
tioned the argument will serve as a prototype for the argument of Theorem 3.4 and for 
this reason we will be quite detailed. 


9.1. First variations. There are two natural types of variations that can be used to 
perturb Dir-minimizing Q-valued functions. The hrst ones, which we call inner variations, 
are generated by right compositions with diffeomorphisms of the domain. The second, 
which we call outer variations, correspond to “left compositions”. More precisely, let / be 
a Dir-minimizing Q-valued map. 

(IV) Given (p e C)P(r2,M”^), for e sufficiently small, x i-)- ^^(a;) = x + e(p{x) is a diffeo- 
morphism of D which leaves dQ hxed. Therefore, 


0 


d 

ds 


£■=0 





(42) 


(OV) Given tjj G C°°(r2 x M"',R”) such that spt('^) C D' x R” for some D' CC D, we set 
= Ei and derive 


0 


d 

de 



(43) 


Using the multivalued chain rules we can turn the conditions (42) and (43) into the following 
identities: 


Proposition 9.1 (First variations, cf. [26, Proposition 3.1]). Let f : Vt ^ Mq(R"') be 
Dii-minimizing. For every (p G C“(r2,R™'), we have 

2 j '^{Df\-Df\-Dip) - j \Dffdwip = t). (44) 

For every G x R[];,R”) such that 

spt('^) C D' X R” for some D' CC D, 


Du'ip\ < C < oo and |'^| -|- {Dx'ipl < C (1 -|- |m|) , 


and 


(45) 
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we have 


j ^ {Dfi{.x) : fi{x))) dx + j {Dfi{x) : Du'ip^x, fi{x)) ■ Dfi{x)) dx 


0 . 

(46) 


9.2. The monotonicity of the frequency function. (44) and (45) give particularly 
interesting identities when tested with functions which depend on |a;|. In what follows, u 
will always denote the outer unit normal on the boundary dB of a given ball. The following 
proposition gives the relevant identities when we test with the singular functions ifly) = 
lBr{x){y)y and 'ip{x,u) = ulBr{x){y) (the proof follows from a standard regularization of 
these ip and 'ip). 


Proposition 9.2 (cf. [26, Proposition 3.1]). Let x E and / : ^ .4,q(M") be Dir- 

minimizing. Then, for a.e. 0 < r < dist(a;, (9f2), we have 

(m - 2) / \Df\^ = T [ \Df\^ -2r [ Y (47) 

J Br{x) JdBr{x) JdBr{x) ^ 

f \ DW = [ (48) 

J Br(x) JdBr(x) ^ 

We next introduce Almgren’s frequency function and state his celebrated monotonicity 
estimate, which is a straightforward consequence of the identities (47) and (48). Recall the 
notation |/| for the function G{f,Q |0]). 


Definition 9.3 (The frequency function, cf. [26, Definition 3.13]). Let / be a Dir- 
minimizing function, x E Ll and 0 < r < dist(a;, dfl). We dehne the functions 



Hxj{r) = [ I/P and 4j(r) 

JdBr 


rP^.Ar) 
Wj{r) ' 


(49) 


Ix,f is called the frequeney function. 


When X and / are clear from the context, we will often use the shorthand notation D{r), 
H{r) and I{r). 


Theorem 9.4 (Monotonicity of the frequency function, cf. [26, Theorem 3.15]). Let f be 
Dir-minimizing and x E Ll. Either there exists > 0 such that f\Bi,{x) = 0 or Ixjir) is 
an absolutely continuous nondecreasing positive function on ]0, dist(a;, (911) [. This function 
takes a constant value a if and only if f{y) is a-homogeneous in y — x. 


This monotonicity is the main ingredient in the proof of both Theorems 3.4 and 8.4. 
An important observation, which was hrst made in [29], is that the frequency function 
can be thought as a “singular limit” of smoother objects, i.e. of regularized frequency 
functions, which are also monotone. This simple remark (which is not present in Almgren’s 
monograph) gives an important advantage: the regularized frequency functions enjoy better 
continuity properties in terms of /. 
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Definition 9.5 (Regularized requency functions). Assume 0 is a Lipschitz nonnegative 
nonincreasing compactly supported function on [0,1[ which is constant and positive in a 
neighborhood of 0 and define 


Do,/(r) 

Ho,/(r) 

Io,/(r) 


;= J 0 ^—^ \Df\‘^{x)dx 



r'Dpjjr) 

Ho,/(r) 


Theorem 9.6. Let f be Dir-minimizing and 0 G ff. Either there exists g such that /|sj,(o) = 
0 or Io,/(r) is an absolutely continuous nondecreasing positive function on ]0, dist(a;, (9f2)[. 
This function takes a constant value a if and only if f{x) is a-homogeneous in x. 


We do not have a reference for the latter theorem, which follows from a straightforward 
adaption of the arguments used in the proof of Theorem 15.8. A special case of Theorem 
9.6, namely for a special choice of the cut-off 0, is hidden in the computations of [29, 
Theorem 3.2] (cf. in particular [29, Eq. (3.13)]). 


9.3. The two fundamental consequences of the monotonicity formnla. Theorem 
15.8 has two crucial consequences, when “blowing-up” a given Dir-minimizing function. 
More precisely, consider a Dir-minimizing / taking Q > 1 values and a point p in its 
domain. Without loss of generality we can assume that p = 0. If the support of /(O) 
contains two different points, then, by continuity, in a neighborhood U of 0 f splits into 
two separate functions ui and U 2 which are both and continuous. It is simple to see 
that both must be minimizers of the Dirichlet energy in U. 0 is then a good point, where 
we have reduced the complexity of the problem. For instance, if Q were 2 we would know 
that Ui and U 2 are two classical (single valued) harmonic functions. The “problematic 
points” are then those p where f{p) = Q |g]. 

We can therefore assume that /(O) = Q |g] for some q G M”. Now, according to our 
definition of the singular set Sing(/), we have two possibilities: 

(a) / equals Q copies of a classical harmonic function in a neighborhood of 0; 

(b) 0 is a singular point for /. 

In general, an interesting object to look at is the average of the sheets of / = [/*!) 

namely A f^. For this average we fix the notation f] o f. It is not difficult to see that 
r/ o / is a classical harmonic function. Indeed, if we define 

f fl , 

i 

it is immediate to see that Dir(/) = Dir(/) -|- QDir{r] of). In particular it is not difficult 
to conclude that / is also a Dir-minimizer, cf. [26, Lemma 3.23]. Looking at the latter 
function we can thus restate the alternative as: either / = Q | 0 ] in a neighborhood of the 
origin, or 0 is a singular point for / (and thus a singular point of /!). 
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The discussion above leads to the consideration that, without loss of generality, we can 
assume r} o f = 0. Assume further that the (more interesting!) alternative (b) above 
holds. Then / does not vanish identically and therefore both DQj{r) and Hoj{r) are 
positive for some r. Using Theorem 8.2 it is not difficult to see that, under the assumption 
/(O) = Q |0], we have a uniform bound of the form 


HoAr) < CrDoAr) 


Vr G 


Q dist(0,5ri) 

2 ’ 


(50) 


where the constant C is independent of /. The obvious consequence of Theorem 15.8 is 
that there is also a reverse control 


rDoAr) < CHoAr) (51) 

although the latter constant C depends upon the point (0 in this case) and the function 
/. Indeed such constant approaches, for r 0, the limit /o(/) := -fo,p(/), which by 

(50) is bounded away from 0 and by Theorem 15.8 is hnite: on the other hand we have 
no explicit (neither universal!) upper bound: we insist that /o(/) depends upon / and the 
particular point (0 in this case) where we are “blowing-up”. 


Consider now the rescaled functions fo,r{x) '■= f{rx) and their renormalized versions 


UoAA 


fo,r 

Dir(/o,.,5i)V2- 


In particular the energy of Wo.r is 1. However the norm of |no,r| is also under control 
because of (50). We then have compactness for the family {no,r}r- Fix a map u which is 
the limit of any subsequence uo^r^ with 0. It is not difficult to see that a sequence of 
minimizers with such uniform controls converge strongly in lU^’^ in any compact subset: 
namely the Dirichlet energy of the limiting function is the limit of the Dirichlet energy of 
the corresponding functions on any subdomain H which is compactly contained in i?i(0), 
cf. [26, Proposition 3.20]. However the minimizing property alone does not guarantee 
strong convergence on the whole domain Hi(0). 

To understand the latter statement, consider for instance the planar (single valued!) 
harmonic functions 

fk{xi,X 2 ) = Re {xi + ix^f 

and their normalizations 

Uk := /fc/Dir(/fc,Ri(0)) . 

It is very elementary to see that Uk converges to 0 in i?i(0): in fact most of the Dirichlet 
energy of Uk lies in a thin layer around the boundary (9i?i(0). For k large the layer becomes 
thinner and thinner and all the energy is “pushed” towards the boundary dBi{0). On the 
other hand it is easy to see that the ratio 


Do,ukiX) ^ 1 

Ho^UkA) 

explodes, namely that the norm of Uk on dBi{0) converges to 0. 
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This highlights the hrst important consequence of the frequency function: the “reverse 
Poincare” inequality (51) excludes that the energy of Uo,r concentrates towards the bound¬ 
ary. Any limit h of a sequence Mo,rfc uiust therefore have energy equal to 1. Since Theo¬ 
rem 8.2 guarantees uniform convergence, we also conclude that 11(0) = Q |0]. Moreover, 
r] o u = 0 because r] o uo,r = 0 . 

Thus 0 must be a singular point of u as well; the only way it could be regular around 0 
would be to take the value Q |0] identically in a neighborhood of 0. However notice that 
lofii'f') = -^o,/(0) =: a for every r. But then Theorem 15.8 implies that u is a-homogeneous, 
and if u would vanish in a neighborhood of 0 , then it would vanish on the entire ball i?i( 0 ), 
contradicting the fact that the Dirichlet energy of u is indeed 1. 

The conclusion is that the singularity has persisted in the limit. Recalling that our main 
concern in proving Theorem 3.4 was the disappearance of singular points along sequences 
of converging currents, the reader will understand why the monotonicity of the frequency 
function is such an exciting discovery. It must also be noticed that the monotonicity of 
the frequency function was unknown even for classical single valued harmonic functions 
before [5]: the shear observation that Almgren was able to discover a new fundamental 
fact for classical harmonic functions around 1970 gives in my opinion the true measure of 
his genius. 

The second fundamental consequence of the monotonicity of the frequency function is 
that Io,u{r) is indeed constant in r and equals a := /o,/(0), which, as already noticed, gives 
that We have u is a-homogeneous. In particular when the domain is 2-dimensional, it 
is not difficult to classify all a-homogeneous Dir-minimizers and to show that their only 
singularity is at the origin, cf. [26, Proposition 5.1]. 

The careful reader will recognize the formal analogy with the two ingredients of Federer’s 
reduction argument illustrated in Section 4.5: pretty much the same reasoning gives the 
proof of Theorem 8.4. There is however one important difference: for a Q-valued minimizer 
/ on a 2-dimensional connected domain we do not conclude the discreteness of Sing(/), 
but rather the weaker statement that 

• either “multiplicity Q points” of / are isolated; 

• or / collapses to Q {rj o fj. 

Only in the case Q = 2 the statement above is equivalent to discreteness of the singular set 
of /. When, for instance, Q = 3,we have not ruled out that singular points with “2-sheeted 
branching” could converge towards a singular point with a “3-sheeted” branching. 

Thus, the argument sketched above gives, in the 2-dimensional case, that Sing(/) is 
countable, but it does not imply its discreteness. The proof of Theorem 8.5 needs much 
more work and in particular it passes through the important additional conclusion that 
the tangent functions u analyzed above are unique, namely the renormalized blown-up 
functions Mo,r have a unique limit as r 0, cf. [26, Theorem 5.3]. At present this uniqueness 
is an open problem when the dimension of the domain is higher than 2 . 
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10. Approximation with multiple valued graphs 

Following the intuition that a “sufficiently flat” area minimizing current is close to the 
graph of a Dir-minimizing multivalued function, we wish now to use the theory above to 
infer some interesting informations upon area minimizing currents in a region where they 
are rather flat, i.e. the tangent planes are almost parallel to a given one (at least in an 
average sense). For this reason we introduce the notion of cylindrical excess. We will denote 
by Cr{p) the cylinder Br{x) x R” when p = [x, y) G R™ x R”. In fact in the future we wish 
to consider cylinders with bases parallel to different m-dimensional planes: having fixed an 
m-dimensional plane n, we set Br{p,n) := Br(p) fl (p + tt) and Cr{p,7i) = Br{p,n) + 

The notation ttq will be reserved for the “horizontal plane” R"^ x {0} and we will use 
and for the orthogonal projections onto n and tt"*-. 

Definition 10.1 (Cylindrical excess). Given an integer rectifiable m-dimensional current 
T in R™+^ with finite mass and compact support and m-planes tt, tt', we define the excess 
of T in the cylinder Cr{x,7i) compared to n' as 

E(T,a(a;,7r),7r'):=(2a;^r"^)-' [ |f - Trf d||T|| . (52) 

J Cr (ai,7r) 

If TT = tt', then we write E(T, Cr(a;, tt)). 

The height function in a set A C R™'+™- with respect to an m-dimensional plane tt is 
h(T,A,7 r):= sup |p^±(a;) - p^±(i/)| . 

x,y S spt(T) n A 

10.1. Mnltivalued push-for wards, graphs, the area formula and the Taylor ex¬ 
pansion of the mass. One first technical detail that we have to tackle concerns the 
currents which are naturally induced by multivalued maps. Assume therefore to have fixed 
a Lipschitz map F : R™' D O —)■ A.q(R™'’'"”) on a bounded open set O. Consider the regions 
Mi and the functions // of Lemma 7.4. Through them we can define the “multivalued” 
pushforward 

tf := [Ml. 

i 

In a similar fashion we can define multivalued pushforwards when the domain O is a 
Riemannian manifold with finite volume. Moreover the current naturally carried by the 
graph of a multivalued function u can be defined using the pushforward through the map 
X i-A |(a;, Mj(a;))]. The corresponding current will be denoted by G„, whereas for the 
set-theoretic objects we will use the notation Im(F) and Gt{u). 

The currents introduced above are well-defined (namely they are independent of the 
decomposition chosen in Lemma 7.4) and in fact the assumption of boundedness of 
and flniteness of the volume of M can be removed if F satisfies a suitable “properness” 
assumption, cf. [27, Definition 1.2 & Definition 1.3]. Moreover, the usual formulas and 
conclusions valid in the classical-valued setting holds in the multivalued case as well. We 
record here some important conclusions. 
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Lemma 10.2 (Bilipschitz invariance, cf. [27, Lemma 1.8]). Let F : E —)■ .4.q(M"') be a 
Lipschitz and proper map, $ : S' —)■ S a bilipschitz homeomorphism and G := Fo<|>. Then, 
Ti. = Tg. 

Lemma 10.3 (Q-valued area formula, cf. [27, Lemma 1.9]). Let S C be a Lipschitz 
oriented submanifold, M <zT a measurable subset and F : M ^ ^q(M"') a proper Lipschitz 
map. For any bounded Borel function h ■.'ML ^ [0, cxd[, we have 

hip)d\\TF\\ip) < [ V/i(F^'(a;)) JF^(a;)d77”'(a;), (53) 

Jm ^ 

where 

iF^{x) = \DF^{x)^e\ = ^dei{{DF3{x)Y ■ DFi{x)) 

Equality holds in (53) if there is a set M' C M of full measure for which 

{DF^{x)Y{x)^ — 0 ^^^y ^ andi,j with F\x) = F^{y). (54) 

If (54) holds the formula is valid also for bounded leal-valued Borel h with compact support. 

Corollary 10.4 (Area formula for Q-graphs, cf. [27, Corollary 1.11]). Let S = M'”, 
M C and / : S —)■ ^q(M"') be a proper Lipschitz map. Then, for any bounded 
compactly supported Borel h : —)■ M, we have 

f mi 

h{p)d\\G^f\\{p)= j '^h{x,fi{x))(^l + Y^ {detAfydx. (55) 

i k=l AeM'^iDp) 

Theorem 10.5 (Boundary of the push-forward, cf. [27, Theorem 2.1]). Let T be a Lipschitz 
submanifold ofM^ with Lipschitz boundary, F : E —)■ .4,q(M'') a proper Lipschitz function 
and f = F|as. Then, dTp = T/. 

One crucial point in our discussions is the Taylor expansion of the mass of a multivalued 
graph. 



Corollary 10.6 (Expansion of M(G/), cf. [27, Corollary 3.3]). Assume O C is an 
open set with bounded measure and f : Q ^ .4,q(M”) a Lipschitz map with Lip(/) < c. 
Then, 

M(G;) = QVI A- j \DW + / E . ( 56 ) 

^ JO I 

where R 4 E satisfies |F 4 (F)| = \DfL{D) for L with Lip(L) < C and Z(0) = 0. 


10.2. The main approximation theorem. If T is an integral current without boundary 
in Cr(0), a Lipschitz u : F,,(0) —)■ .4.q(M"') is an efficient approximation if M(T — G^) is 
small (compared to r”^). Since G^, is, in a “loose” sense a Q-fold cover of F,,(0), namely 
(P 7 ro)t|Gn = Q |Fr(0)], this condition must hold for a well-approximated current T as well. 
We are now ready to summarize the main assumptions under which we wish to hnd a good 
Lipschitz multivalued approximation, which is achieved in Theorem 10.8. To simplify our 
notation p^^ and p;]:^ will be denoted by p and p-*“. 
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Assumption 10 . 7 . S C is a submanifold of dimension m + n = m + n — l, which 

is the graph of an entire function 'h : —)■ M! and satishes the bounds 

||^'h||o<Co and A ;= ||As||o < Cq, (57) 

where cq is a positive (small) dimensional constant. T is an integral current of dimension 
m with bounded support contained in S and which, for some open cylinder (with 

r < 1) and some positive integer Q, satishes 

PjjTLC 4 r(a;) = Q |i? 4 r(a;)] and dT\_Cir{x) = 0. (58) 

Theorem 10.8 (Strong approximation, cf. [30, Theorem 1.4]). There exist eonstants 
C, 7 i,ei > 0 (depending on m,n,n,Q) with the following property. Assume that T is area 
minimizing, satisfies Assumption 10.7 in the eylinder and E = E(T, C 4 r(a;)) < ei. 

Then, there is a map f : Bfix) —)■ Aq(M”), with Gr(/) C S, and a elosed set K C Br{x) 
such that 


Lip(/) < + CAr, 

G/L(AxR”) =TL(AxM'^) and \Br{x)\ K\ < C {E + r"^ A^) r^, 


(59) 

(60) 


||T||(a,(a;))-gcn^(ar)’" 



< CE^^ {E + r‘^A^) r™ 


VO < a < 1. 

(61) 


If in addition h(T, C4r(a;),7ro) < r, then 

osc (/) < C'h(T, Cir{x),Tio) + C{E^/^ + r A) r , (62) 

where osc (/) := sup{|p — q\ : p E spt(/(a;)), q G spt(/(?/)), x, y G Bfix)}. 


We note that the theorem is scaling invariant and thus it suffices to prove it in the case 
r = 1. Moreover, for simplicity we will mostly ignore S and often assume that T is area 
minimizing in the whole euclidean space: this will be of great help in illustrating the main 
ideas behind the proof, avoiding some technicalities. 

An elementary computation shows that, under Assumption 10.7, 


E(T, a(a;)) 


||T||(C.(a;)) 


-Q. 


It is then natural to introduce the following “excess measure”: 


Definition 10.9 (Excess measure, cf. [30, Dehnition 1.2]). For a current T as in Assump¬ 
tion 10.7 we dehne the excess measure Gt and its density d'r: 


eriA) := UTlKAxR’^) 

^riy) ■= limsup 


-Q\A\ 

eriBsiy)) 


5^0 


COri 


for every Borel A C Bfix), 
= hmsupE(T, 0^(1/)). 
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10.3. BV estimate for slices and first approximation. It is rather clear that the 
smallness of the cylindrical excess prevents the tangent plane to T at p to have negative 
intersection with {p(p)} x M"" at most points p in spt(T). In fact this is a simple measure- 
theoretic fact: even without assuming that T is area minimizing, it remains true that, 
under Assumption 10.7, most slices (T, p, p) will be elements of .4,q(M"'). The points y 
which violate the latter property will form a set of small measure. 

It is instructive to see what happens if Q = 1 and T is assumed to be a-priori the graph 
of a classical map v, assuming a Lipschitz bound like Lip(u) < 1. The cylindrical excess E 
is then comparable, up to constants, to the norm of Dv. It is a classical statement for 
a (single valued) Sobolev map that a Lipschitz control holds on the restriction of the map 
on a fairly large closed set, cf. for instance [37, Section 6.6.3]. Indeed a way to identify a 
good set on which such Lipschitz bound holds is to look at those points where the Hardy- 
Littlewood maximal function of \Dv\ is suitably small. Under our idealized situation, \Dv\^ 
is comparable to the excess density introduced above. This motivates the introduction 
of a maximal function in our setting 

Definition 10.10 (Maximal function of the excess measure, cf. [30, Definition 2.1]). Given 
a current T as in Assumption 10.7 we introduce the “non-centered” maximal function of 
ex'- 

mer(|/) := sup ^ E(T, C,(u;)). 

y&Bs(w)(lB4r{x) y&Bs{w)(lB4r(x) 

Going on with our Sobolev space analogy, if we denote by E the square of the norm 
of I Du I (normalized by r™) and we let K be the set where the maximal function of |Dup 
lies below the threshold then the restriction of u to 77 will have Lipschitz constant 

D'’'! and the size of the complement of K can be estimated with Of course we 

can then extend v\k outside 77 to a Lipschitz function with essentially the same Lipschitz 
bound. Neglecting the effect of T, it is then clear that, only relying on Assumption 10.7 
we can hope for estimate (59) if we replace the superlinear in (60) and (61) with, 

respectively, 77^“^^ and E. 

This heuristic discussion can be in fact made rigorous in a very direct way relying on 
some recent developments in geometric measure theory. Regarding the slicing map (T, p, •) 
as a map taking values into the space of 0-dimensional currents (endowed with a suitable 
metric) and using the formalism introduced by Ambrosio in [6] for BV maps with metric 
targets, Jerrard and Soner have given in [48] a rather elementary way to prove that such 
map is a function of bounded variation, with norm which can be controlled with the mass 
of T and the mass of its boundary. Ambrosio and Kirchheim used then this idea in [9] to 
develop part of their general theory of metric currents and give a rather efficient and general 
approach to the Federer-Fleming compactness theorem. The resulting computations must 
be suitably adjusted to our setting. However the theory allows a quite direct proof of the 
following 

Proposition 10.11 (Lipschitz approximation, cf. [30, Proposition 2.2]). There exists a 
constant C > 0 with the following property. Let T and T be as in Assumption 10.7 in the 
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cylinder 045 ( 0 ;). Set E = E(T, 045 ( 0 ;)), let 0 < < 1 6 e such that < ^n, and 

define 

K := {meT < (5ii} fl ^ 35 ( 0 ;) . 

Then, there is u E Lip(i? 35 (o;), v4,q(M”)) such that Gy{u) C S and 

Lip(M) < + ||£>'I'||o), osc(m) < Oh (T, 045 ( 0 ;), TTo) + Os||0>^||o, 

Gu\-{K X M”) = Tl{K X M”), 

10 ™ / \ 

\Br{x)\K\<— — eTi{meT>2 fl i?r+ros(2^)) Vr<3s, (63) 

dll V / 

where tq = 16 ^E/du < 1 . 

From Proposition 10.11 one derives immediately a version of Theorem 10.8 where the 
bound (59) is correct, whereas in the bound (60) the factor must be replaced by 

and in the bound (61) must be replaced by E. In the rest of this section 

we will discuss why the area minimizing assumption, which so far we have not yet used, 
allows to improve the bounds to achieve Theorem 10.8. 


10.4. Superlinear gain. Going back to our heuristic idea, in which T is replaced by 
the graph of a single valued function v and the excess by the square of the Lfi norm, 
the “maximal function truncation” described in the previous paragraph would deliver the 
desired superlinear estimates if we knew that the norm of Dv were controlled by E^^^, 
namely the norm of Dv, for some fi > 0. This amounts to a reverse Holder inequality 
of the form 

Ilf^'dlUp < C'||-D'd||L 2 for some p > 2 . (64) 

In our setting one possible translation would be: the excess measure is absolutely 
continuous and its density dy enjoys the estimate 

||dT||Li+®(B 2 r(a;)) — C'eT(i?4r(a;)) < Cr^E . (65) 

This is certainly not correct under the only Assumption 10.7: it is clear that in order to 
hope for such a bound we need to use the hypothesis that T is area minimizing. We do not 
know whether (65) is correct under the additional assumption that T be area minimizing: 
even if it is, we expect that its proof is rather difficult, see the discussion below. However, 
the cornerstone to our approach to Theorem 10.8 is that the following slightly weaker form 
of (65) is correct and can be achieved with a moderate effort. 


Theorem 10.12 (Gradient estimate, cf. [30, Theorem 1.3]). There exist constants 
Pi > 1 and G, £10 > 0 (depending on m,n,n,Q) with the following property. Let T he as 
in Assumption 10.7 in the cylinder C4. IfT is area minimizing and E = E(T, C4) < £10, 
then 




THE SINGULAR SET OF AREA-MINIMIZING GURRENTS 


45 


From Theorem 10.12 and Proposition 10.11 we cannot conclnde directly Theorem 10.8 
because we lack control on the set where d'r is rather high (and on the singular part of the 
measure ey!). We would rather need an estimate for the regions where the tangent to T 
has high slope, compared to ttq. Theorem 10.12 can be indeed used to prove something of 
that kind: 

Theorem 10.13 (Almgren’s strong excess estimate, cf. [30, Theorem 6.1]). There are 
constants £ 11 , 711 , C > 0 (depending on m,n,n,Q) with the following property. Assume T 
satisfies Assumption 10.7 in C4 and is area minimizing. If E = E(T, C4) < £ 11 , then 

MA) < C + |Ap“) (E + A^) for every Borel A C B 9 . (67) 

Actually, in the case of a classical single valued map Theorem 10.13 could be concluded 
directly by comparing the mass of the current T with that of a suitable convolution of 
the approximating Lipschitz map. The effect of the convolution is to smear high gradients 
and show that they are energetically not favorable. A surrogate of this regularization 
process can be achieved using the map p* of Proposition 7.21; to regularize a map u we 
can hrst regularize ^ o n by convolution and then gain a multivalued map composing the 
regularization with o p*. We can compare the Dirichlet energy of this map to that of 
the regularization of ^ o n. Note however that if we want a small error we have to choose 
the parameter 6 in Proposition 7.21 small and yet hope that the second integrand in the 
right hand side of (34) remains small. Theorem 10.12 allows to control that dangerous 
term. 


10.5. Higher integrability and harmonicity. Going back to our analogy, we know 
that if T were the graph of a function, the minimality assumption and the smallness of the 
excess should imply that v is close to an harmonic function. Of course for single valued 
harmonic functions then the reverse Holder inequality (64) is true for any exponent p > 2. 
On the other hand we already discussed that, for a suitable choice of p, the same reverse 
Holder inequality does hold in the multivalued setting as well, cf. Theorem 10.13. This 
suggests that in order to prove Theorem 10.12 we could hrst show that the Lipschitz map of 
Proposition 10.11 is almost Dir-minimizing. Looking at Theorem 10.8 it is rather intuitive 
that the “almost Dir-minimality” of / should correspond to have a o{E) in place of 
in (61), where o{E) is any function of E which vanishes faster than E at 0. Now, using an 
energetic comparison, such a gain would correspond to show that 


'Br{x)\K 


\Df\^ 


= r^o{E). 


If this were not true we could run a contradiction argument over a sequence of currents Tk 
with vanishing excess Ek and look at the normalized approximations Uk '■= fk/E^^^. We 
could also rescale the corresponding balls to have radius 1 and center 0. The m-dimensional 
volume of the corresponding bad sets Bi \ Kk is converging to 0 and in spite of that 


lim inf 

k 



> rj 
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for some positive rj. If we assume that Uk is converging in to some u, the Dirichlet 
energy of u would then satisfy 

liminf / l-DufcP > / + rj. 

^ J J 

1 /2 

But then the graph of E^: u must have less mass than and we could hope to modify it 
and gain a comparison current which would contradict the minimality of T^, at least for k 
sufficiently large. 

Recalling Section 8.1 there is a delicate point to address, namely that for multiple valued 
functions a uniform control on the Dirichlet energy of a sequence does not imply directly 
compactness, since the separation between sheets could explode along the sequence. This 
is achieved with Proposition 8.8. Incidentally the analysis above shows also that the 
approximation of Theorem 10.8 is close to a Dir-minimizer, which we record in the following 
theorem. 


Theorem 10.14 (Harmonic approximation). Let 71 he the constant of Theorem 10.8. 
Then, for every 7, <5 > 0, there is a positive constant with the following property. Assume 
that T is as in Theorem 10.8, E := E(T, C 4 r{x)) < ei and rA < If f is the map in 

Theorem 10.8 and we fix suitable coordinates, then there exists a Dir-minimizing function 
u : Br{x) —)■ .4,q(M”) such that w := {u, !&(?/,«)) satisfies 


r 


-2 



G{f,wf + 



{\Df\-\Dw\f + 



\D{r}o f) — D{r}ow)\‘^ < 7] Er^ . (68) 


11. A FIRST ATTEMPT TO PROVE THEOREM 3.4 

In this section we summarize what we have achieved so far and propose a hrst strategy 
to show Theorem 3.4. After resolving the hrst important issues, we will have to face a 
major obstacle: more than half of Almgren’s monograph is in fact dedicated to overcome 
this point and even in the proof given by [26, 27, 30, 28, 29] the same obstacle is responsible 
for roughly one quarter of the combined length, namely paper [28]. 

The strategy to prove Theorem 3.4 starts similarly to the Federer’s reduction argument. 
Assume that there is an area minimizing current T of dimension m > 2, in a sufficiently 
smooth Riemannian manifold S, which has a large singular set Sing(T): more precisely we 
assume that, for some a > 0, 'H™“^’''"(Sing(T)) > 0. 

From Theorem 6.2 and Lemma 6.3 we conclude immediately that at p G 

Sing(T) there is one hat tangent plane and the multiplicity is integral. Let us introduce 
the notation Dq(T) for those points in spt(T) \ spt((9T) where the density of T is the 
positive integer Q. Similarly, we set SingQ(T) := Dq(T) fl Sing(T). We then know that 
Sing^(T) is empty. Indeed the assumptions (a), (b) and (c) in Theorem 4.2 follow from the 
monotonicity formula when p is sufficiently small. The second assumption in (d) is also 
fulhlled: since we can assume that the second fundamental form of S is bounded, for p 
sufficiently small we obviously have pA < e. It would remain to prove that the excess with 
respect to some plane is suitably small at a sufficiently small scale. This is however not 
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difficult since all tangent cones at a point p with 0(T,p) = 1 must be necessarily flat: it 
can be shown that the only area minimizing m-dimensional cones S with US'!!(Bi(0)) = Um 
are m-dimensional planes counted with multiplicity 1. 

We stop for a moment to observe the following interesting consequence of the above 
discussion. Let p be a point in spt(T) where the multiplicity is Q and assume that the 
surrounding points in spt(T) have the same multiplicity at a sufficiently small scale, say 
in Bp(p). Then S := T/Q is a. well defined integer-rectifiable area minimizing current in 
Bp(p) and moreover p G Di(S'). Thus S is regular in a neighborhood of p. We summarize 
the outcome of the latter discussion in the following 

Corollary 11.1. If S is an area minimizing cone with Q{S, 0) = 1, then S is a flat plane 
with multiplicity 1. 

Let T he an area minimizing current in a Riemannian manifold S. If p E Dq(T) 
and there is a neighborhood U of p where the density is Q at ||T||-a.e. point, then p is a 
regular point. 


We next recover our discussion and look at the current T which should contradict The¬ 
orem 3.4. We infer from Corollary 11.1 that there must be an integer Q > 1 such that 
SingQ(T) has positive 'H™'“^’''"-measure. Now, recalling the approach of Federer’s reduction 
argument, we know that for 'H™“^+“-a.e. p G SingQ(T) we have 


lim inf 

r40 


?{S~^+"(Sing(T)nB,(p)) 


> 0 . 


(69) 


Moreover, by Theorem 6.2 we can assume that at least one tangent cone at p is flat. We 
thus have a sequence of rescalings which are converging to a flat plane and a sequence 
of rescalings Tp^r^ for which (by (69)) 

lim n B,(0)) = ., > 0 . (70) 


Of course the sequence does not necessarily coincide (or is comparable to) {r^}. 
However, it can be shown that, w.l.o.g., the two sequences can be assumed to coincide (cf. 
[29] and [62, Section 4]). More precisely 


Proposition 11.2 (cf. [29, Proposition 1.3]). If Theorem 3.4 were false then there would be 
an area minimizing current T in a smooth Riemannian manifold S, a point p G SingQ(T) 
and a seguence of rescalings Tp^r^ converging to a flat plane of multiplicity Q and such that 
(70) holds. 


We will see in a moment the simple idea behind Proposition 11.2. Taking it for granted, 
one could at this point hope to carry the following program: 

(A) We apply Theorem 10.8 to construct a sequence of Lipschitz maps fk whose graphs 
approximate efficiently 

(B) After normalizing suitably fk, we apply Theorem 10.14 and, up to extraction of 
a further subsequence, assume that it converges to a Dir-minimizing multivalued 
map foo] 
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(C) We finally use (70) to show that f^o has a singular set of positive 'H™' measure: 
this would contradict Theorem 8.4. 

(C) is again a “persistence of singularity” statement. Unfortunately it is not difficult to 
see that it is false in this form and thus the rough strategy outlined above must be suitably 
adjusted. After dealing with Proposition 11.2 we will discuss in detail why (C) fails and 
propose therefore a new strategy to prove Theorem 3.4. 

11.1. The existence of a good sequence. The proof of Proposition 11.2 is still a suitable 
modification of Federer’s reduction argument. By the discussion above, we first choose a 
point p G SingQ(T) and a sequence j, 0 where (70) holds. Assume without loss of 
generality that p = 0. If To^n, converges to a flat plane of multiplicity Q we are done. 
Otherwise we can assume that it converges to some tangent cone S, which is singular and 
such that ©(S', 0) = Q. We now wish to show that 'H™“^+“(SingQ(S')) > 0. 

First of all, by the monotonicity formula, 'H™“^'''“(Dq(S')) > 0. Now, if 

WS-"*“(Siiige(S)) = 0 (71) 

then many of the points in Dq(S') should be regular: let us denote by RegQ(S') the set 
of such points. Note that RegQ(S') is relatively open. If S'L(RegQ(S')) has nonempty 
boundary, then the support of such boundary consists of elements in SingQ(S') and it has 
positive 77 ™“^ measure. The latter statement is not compatible with (71) for a < 1. Thus 
S' = S'L(RegQ(S')) has no boundary. So it is an area minimizing cone with multiplicity 
Q at every p G spt(S") \ {0} and with multiplicity no larger than Q at the tip 0 (because 
115" 11 < II S'!!). On the other hand the upper semicontinuity of the density implies that 
0(*S",O) > Q: thus IIAII(Bi(0)) = Qum = ||>S"||(Bi(0)). Hence S and S' must coincide. We 
thus conclude that Corollary 11.1 is applicable to S, which must be flat, contrary to our 
assumption. 

Having found that S is another area minimizing current with large SingQ(S'), we can 
apply the discussion above to some point p G spt(S') \ {0}. We thus hud a sequence Sp^n, 
such that limfc'H™“^+"(SingQ(S'p,rfc)) > 0. As above, Sp^rt can be assumed to be converging 
to some tangent cone Z: if it is flat, we then have achieved the conclusion of Proposition 

11.2. Otherwise 'H™“^+"(SingQ(Z)) > 0 and we can restart with Z in place of S': this 
time, however, Z splits off a line. Iterating this procedure we keep “splitting off” lines, 
until eventually we must reach a sequence as in the statement of Proposition 11.2. 

11.2. Persistency of multiplicity Q points. Having proved Proposition 11.2, we are 
now in the position to carry on our hrst program. Point (A) is obvious and we have to face 
point (B). Let us fix a sequence as in Proposition 11.2 that it is converging to Q [tto] where 
TTo = X {0}. Thus, for a sufficiently large k, Theorem 10.8 applies to fhe cylinder 

©4(0). Let fk be the corresponding approximating maps fk : i7i(0, ttq) —)■ .4,q(M"'). It is not 
obvious, apriori, that we can apply Theorem 10.14, since the excess Ek := E(To,,.j,, ©4(0)) 
might converge to zero too fast compared to r^A. Let us leave this technical problem aside: 
we then could assume that Uk := fk/El^^ is converging to a Dir-minimizing map Uoo- 
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Next, we can intuitively expect that Uoo has many points of multiplicity Q, in particular 
all the ones which are limits of sequences lying in pjro(DQ(To^rj,)). Namely, we expect that 
points in p,ro(DQ(To,ri,)) cluster towards points where Uoo = Q\'n ° This intuition is 
correct. In fact we can first prove the following 

Theorem 11.3 (Persistence of Q-points, cf. [30, Theorem 1.7]). For every 6, C* > 0, there 
zss g]0, |[ such that, for every s < s, there exists i{s,C* ,6) > 0 with the following property. 
IfT is as in Theorem 10.8, E := E(T, C4r{x)) < i, < C*E and 0(T, {p,q)) = Q at 
some {p,q) G 0^/2 (t), then the approximation f of Theorem 10.8 satisfies 



g{f,Qlr]ofjf<8s^r^^^E. 

(p) 


(72) 


For the proof of the latter theorem we refer to [30]. The argument relies on the improved 
monotonicity formula of Theorem 3.11. 

Looking back at our u^o, which is the strong limit of Uk = fk/E]!'^, when p is a 
point in the domain of u^o which is the limit of (the projections onto ttq of) a sequence of 
(Pfc, qk) e spt(Tfc) with ©(T^, [pk, qk)) = Q , we then have 


lim/ GiUoo:Ql'n 0 Uooj)‘^ = 0. 

Since u^o is Dir-minimizing and, hence, continuous, we have Moo(p) = Qlv ° '*^oo(p)]- Now, 
we must have a set of points p with positive jxieasures where this occurs. Since 

Theorem 8.4 tells us that the singular set of Mqo has dimension at most m — 2, the only 
alternative left is that Uoo is a classical harmonic single valued function counted Q times. 
That is, once again the singularities have failed to survive in the limit. If we could exclude 
this disappearence of the singularities, we would have reached a contradiction and hence 
proved Theorem 3.4. 


Let us look of what happens if we apply the analysis above when the current T is 
the holomorphic curve of Example 5.4 in a neighborhood of 0. It is obvious that (in 
complex coordinates) the procedure above will deliver the map ^ 00 ( 2 :) = 2 although 
the currents Tq^^ are singular at the origin, u^o is regular. If we compare our situation with 
the proof of Theorem 8.4 outlined in Section 9, it is quite obvious why we failed to capture 
the singularity in the limit; we have not subtracted the “average of the sheets”, namely 
the regular part of our multiple valued function. The latter has much higher energy than 
the branching singularity, which is a very small perturbation: if we do not normalize our 
approximations in some way, we fail to capture the singular behavior in the limit. 


12. The center manieold: construction algorithm 

Summarizing the discussion of the previous section, we are confronted with the following 
problem. Given a Q-valued Dir-minimizing function u = |Mj] , the average of its sheets, 
namely rj o u := ^ is a classical harmonic function and after subtracting it from u 
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we find a new Dir-minimizing Q-valued function 

u := - T) o uj . 

i 

When u is nontrivial, a point p where u{p) = Q |0] is necessarily singular. Loosely speaking 
we could say that u is the “well-balanced part” of u. If an area minimizing current T 
satishes Assumption 10.7 we would like to have a replacement for the average of the sheets 
r] o u and a replacement for the well-balanced Q-valued map it. 

One possibility would be to apply Theorem 10.8: we then gain a corresponding approxi¬ 
mating Lipschitz map /: the average of its sheets, namely r/o/, and its well-balanced part 
/ are both well dehned. However, we wish to use these objects in a blow-up procedure: 
obviously rj o f and / do not serve our purposes, since / is a good approximation of the 
current only at the scale of a certain given cylinder 

We would rather like to localize the idea above. This is obviously only possible in those 
regions (and those scales) where the current is sufficiently flat. On the other hand we might 
not worry about those portions, or those scales, at which the current is not sufficiently flat: 
in the blow-up procedure we wish to capture the limiting behavior of the current around 
those points belonging to SingQ(T) and we already saw in the previous section, namely in 
the proof of Proposition 11.2, that when a lot of points Dq(T) cluster at a certain scale, 
the current is necessarily rather flat. 

12.1. Preliminary considerations and notation. Localizing the basic idea above is a 
very delicate issue, which involves several parameters. First of all, to £x ideas we will 
assume that our center manifold will be constructed in a cylinder C of size comparable to 
1 (namely the radius will be a hxed geometric constant, certainly larger than 1), centered 
at the origin and with basis parallel to ttq := M™' x {0}. We will assume that in the cylinder 
C the ambient manifold S is very close to be flat: this “almost flatness” is measured in a 
suitable norm. 

Assumption 12.1. Eq g]0, 1] is a hxed constant and S C ^ 7 ^ C 7?™+"- is a (m-|-h)- 
dimensional submanifold with no boundary in We moreover assume that, for each 

p G S, S is the graph of a map Tp : TpS fl —)■ TpS-*-. We denote by c(S) the 

number sup^g^^ ||Zl\hp||(^ 2 ,£o. T° is an m-dimensional integral current of with support 


in S n Bgy;^ and hnite mass. It is area-minimizing and moreover 

0(O,T°) = g and dT°LB6^ = 0, (73) 

l|r”ll(B6v™,)<KQ(6x/Sr + £^)p”' Vp<l, (74) 

E (T°, Bg^) = E (T°, Bg^, TTo) , (75) 

mo := max {c(S)^ E (T°, Bq^) } < ej < I . (76) 


£2 is a positive number whose choice will be specified in each statement. 

In what follows we set I := n — n. To avoid discussing domains of dehnitions it is 
convenient to extend S so that it is an entire graph over all TpS. Moreover we will often 
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need to parametrize S as the graph of a map th : M}. Although we do not assume 

that X {0} is tangent to S at any p, it is clear that, when £2 is sufficiently small, S 

is the graph of a function 'h over with ||Zi)\h||<72,60 < CoiriJ^. 

The next lemma is a standard consequence of the theory of area-minimizing currents. 

Lemma 12.2 (Non-empty vertical slices, cf. [28, Lemma 1.6]). There are positive constants 
Co{m,n,n,Q) and Co{m,n,n,Q) with the following property. If is as in Assumption 
12.1, 82 < Co and T := T'’LB23y^/4, then: 

(9 TLCii^/ 2(0 ) TTo) = 0, (p7ro)tt^LCiiy;^/2(0 ) TTo) = Q |i?110TI/2(O, To)] (77) 

and h(T, tq)) < CoiriJ^"" . (78) 

In particular for each x G 511^/2(0,710) there is a point p G spt(T) with P7ro(p) = x- 

From now we will always work with the current T of Lemma 12.2. We specify next some 
useful notation. For each j G N, denotes the family of closed cubes L of tto of the form 

[ai, ai -h 27] X ... X [a^, Om + 27] x {0} C tto , (79) 

where 27 = =: 2 7(L) is the side-length of the cube, Oj G Vi and we require in 

addition —4 < a* < a* -|- 27 < 4. To avoid cumbersome notation, we will usually drop the 
factor {0} in (79) and treat each cube, its subsets and its points as subsets and elements 
of R"^. Thus, for the center xl of L we will use the notation xl = (oi 4- 7,..., + 7), 

although the precise one is (oi -t- 7,..., + 7, 0,..., 0). Next we set ^ := IJ/gN ^ 

and L are two cubes in ^ with H G L, then we call L an ancestor of H and H a descendant 
of L. When in addition i{L) = 27(77), H is a son of L and L the father of H. 

Definition 12.3. A Whitney decomposition of [—4,4]”^ C ttq consists of a closed set 
r C [—4,4]”^ and a family W satisfying the following properties: 

(wl) r U = [—4,4]™ and F does not intersect any element of W', 

(w2) the interiors of any pair of distinct cubes Li,L 2 G W are disjoint; 

(w3) if Li,L 2 G W have nonempty intersection, then |7(Li) < 7(L2) < 2 7(Li). 

Observe that (wl) - (w3) imply 

sep (r,L) ;= inf{|a; — y\ : x E L,y e T} > 2i{L) for every L eW . (80) 

However, we do not require any inequality of the form sep (F, L) < Ci{L), although this 
would be customary for what is commonly called a Whitney decomposition in the literature. 

We next set up a rehnement procedure: starting with an initial grid of dyadic cubes, 
denoted by with sidelength 2^“^°, we subdivide each of them into 2™ cubes H of 

sidelength 27(77) = 7(L). To this purpose we will need to associate to each cube L a 
corresponding ball B ^ which is centered on a point lying “vertically” over the center of the 
cube. Its dehnition will be specihed later. We next proceed to give a rough outline of the 
rehnement procedure. 

Now, given our starting hypothesis, we know that both the excess and the height in B/, 
are small compared to ttq whenever L belongs to the initial grid of dyadic cubes, which are 
taken with side-length comparable to 1. We wish to keep rehning the cubes 77 as long as 
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the height and the excess in the corresponding balls keep sufficiently small. However, 
the reference plane might tilt as we rehne the scales and we wish to keep track of this. 
For this reason, we denote by ttl a given plane for which E(T, = E(T, B^,, ttl) and we 
dehne the height of T in the ball B^ as h(T, B^) ;= h(T, B^, ttl). 

We then stop the refining procedure at some dyadic cube L if 
(EX) The excess has become too large in B^,; 

(HT) The height has become too large in B^,, although the excess has remained small; 
(NN) The rehnement stopped at some cube J which touches L and has double sidelength, 
although in L itself both the height and the excess would be small enough to keep 
rehning. 

The reader familiar with Whitney’s (or Calderon-Zygmund) decompositions will recognize 
that the latter condition is enforced to guarantee that, after all the steps of the rehnement 
procedure have been carried on, all nearby cubes have comparable sides. Unfortunately 
the conditions (EX) and (HT) taken alone do not guarantee this outcome (the troubles 
are indeed caused by condition (EX)) and the extra (NN) is a source of a few technical 
complications. 

12.2. Parameters. In order to detail the conditions (EX), (HT) and (NN) we need to 
introduce several parameters. 

Assumption 12.4. Ce,Ch, (32,^2 , Mq are positive real numbers and Nq a natural number 
for which we assume always 

r 1 ) 

132 = 4:62 = min 5 -—, r , where 71 is the constant of [30, Theorem 1.4], (81) 

Mf) > Co{m,n,n,Q) > 4 and < 1 . (82) 

As we can see, (62 and 62 are hxed. The other parameters are not hxed but are subject 
to further restrictions in the various statements, respecting the following “hierarchy”. As 
already mentioned, “geometric constants” are assumed to depend only upon m, n, n and 
Q. The dependence of other constants upon the various parameters Pi will be highlighted 
using the notation C = C(pi,p 2 , • • •)• 

Assumption 12.5 (Hierarchy of the parameters). In all the statements of the paper 

(a) Mo is larger than a geometric constant (cf. (82)) or larger than a costant C{52)'^ 

(b) Nq is larger than C{l 32 , 62 , Mq); 

(c) Ce is larger than C(/ 92 , ^ 2 , Mq, No); 

(d) Ch is larger than C{/32, 62, Mq, Nq, Ce); 

(e) €2 is smaller than c(/ 52 , ^ 2 , Mq, Nq, Ce, Ch) (which will always be positive). 

The functions C and c will vary in the various statements: the hierarchy above guarantees 
however that there is a choice of the parameters for which all the restrictions required in 
the statements of the next propositions are simultaneously satished. In fact it is such a 
choice which is then made in [29]. To simplify our exposition, for smallness conditions on 
62 as in (e) we will use the sentence “£2 is sufficiently small”. 
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12.3. The Whitney decomposition. Thanks to Lemma 12.2, for every L G we may 

choose hl G so that pl '■= G spt(T) (recall that xl is the center of L). yi 

is in general not nniqne and we fix an arbitrary choice. A more correct notation for pi 
would be xl + Vl- This would however become rather cumbersome later, when we deal 
with various decompositions of the ambient space in triples of orthogonal planes. We thus 
abuse the notation slightly in using (x, y) instead of x + y and, consistently, ttq x instead 
of TTo +7r^. 

Definition 12.6 (Rehning procedure, cf. [28, Dehnition 1.10]). ] For L G we set 
ri := Moi/m£(L) and := B 64 r^(pL). We next dehne the families of cubes ^ C ^ and 
W = with the convention that = y = W and 

n for □ = h,n,e. We dehne = 0 for i < Nq. We proceed with 

j > A"o inductively: if no ancestor of L G is in W, then 

(EX) L G WJ if E(T, Bi) > C'emo£(L)2-2-52; 

(HT) L G if Wi and h(T, B^) > 

(NN) L G Wl if L ^ Wj U Wfl but it intersects an element of 

if none of the above occurs, then L G y^. We hnally set 

r-|-4,4r\ U n U (®3) 

LS/T i>No LsjXj 

Observe that, if j > A"o and L G y^ U then necessarily its father belongs to y^~^. 

Proposition 12.7 (Whitney decomposition, cf. [28, Proposition 1.11]). Lef Assumptions 
12.1 and 12-4 hold and let 62 he suffieiently small. Then (T, is a Whitney deeomposition 
of [—4,4]™' C TTo- Moreover, for any choiee of Mq and Nq, there is C* := C*{Mq, Nq) sueh 
that, if Ce > C* and Ch > C*Ce, then 

= 0 for all] < No+ 6 . (84) 

Moreover, the following estimates hold with C = C{f 42 , S 2 , Mq, Nq, Ce, Ch): 

E(T,Bj) < Oemo£(J)2-2^2 h{T,Bj) < Chmli"’"i{J)^+h2 yJ G ^, (85) 

E(T,Bl) < Omo£(L)2-2^2 and h{T,BL) < C VL G . (86) 

12.4. Interpolating functions. What do we expect from the decomposition above? Fix 
a point T G [—1,1]™. Only two situations might occur: 

(NS) X does not belong to any cube where the rehnement procedure stopped; 

(S) X E L for some cube L where the rehnement procedure stopped. 

Let r be the set of points as in (NS). If the stopping conditions (EX) and (HT) are 
sufficiently severe, we can expect that TL(r x tt;]-) is a Q multiple of a portion of a (rather 
smooth) single valued graph. 

Consider next a point a; G L as in (S). Although we stopped the rehnement at L, at a 
slightly larger scale we still have a very small excess and a very small height: both E(T, B^) 
and h(T, B^) are still rather small. We thus can hope to apply the approximation Theorem 
10.8 in a suitable cylinder C 32 r^(pi,, ttl): we can then construct a good Lipschitz Q-valued 
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approximation t Aqi^nj;), which will be called the til- approximation in 

B^. Finally we take its average t] o Hl and smooth it by convolution with a kernel (which 
we take to be radial, although the importance of this assumption will become clear only 
in the next section). 

In order to carry on our program we £x two important functions -d, g : —)■ M. 

Assumption 12.8. g G C^{Bi) is radial, f g = 1 and f \x\‘^g{x) dx = 0 . For A > 0 
denotes, as usual, x i-A \~'^g{j). -d G C“([—[ 0 , 1 ]) is identically 1 on [— 1 , 1 ]^. 

g will be used as convolution kernel for smoothing maps 2 : dehned on m-dimensional 
planes tt of In particular, having fixed an isometry A of tt onto the smoothing 

will be given by [{z o A) * ^ 7 ] o A~^. Observe that since g is radial, our map does not depend 
on the choice of the isometry and we will therefore use the shorthand notation z * g. 

Definition 12.9 (vr- approximations, cf [28, Dehnition 1.13]). Let L & y VJ W and tt be 
an m-dimensional plane. If TLC 32 r^(pL, tt) fulhlls the assumptions of [30, Theorem 1.4] in 
the cylinder C 32 ri(pL, tt), then the resulting map / ; Bs^^ipLy) —t AQ( 7 r-*-) given by [30, 
Theorem 1.4] is a n-approximation of T in Csr^ipry)- The map h : Bjr^{pL,T^) —t 
given by h ■.= ij] o f) * will be called the smoothed average of the n-approximation, 
where we recall the notation 770/(3;) := Q~^ Yl^=i Q-valued map / = |/j]. 

12.5. Gluing the interpolations. For each L E y U W we let d'L be an optimal plane 
in Bl, namely so that F,(T,'Bl,71l) = E(T, B^) and h(T, B^) = h(T,'BL,7iL), and choose 
an m-plane ttl C which minimizes Ittl — 7^l\- 

We wish to patch the graphs of the various tilted interpolating functions h^ in a single 
submanifold (and we also hope that this submanifold will glue smoothly with the portion of 
the current lying over F!). However, since the graphs of hi are relative to different systems 
of coordinates, we need to parametrize them as graphs in a common coordinate system. 
Given condition (EX), we can hope that along the rehnement procedure the planes ttl did 
not tilt much. If this is the case the graph of each h^ can be described by the graph of 
some in the “original coordinates” tto x tt^. We could then glue the various ql together 
using a partition of unity. 

Lemma 12.10 (Cf. [28, Lemma 1.15]). Let the assumptions of Proposition 12.7 hold 
and assume > C* and Ch > C*Ce (where G* is the constant of Proposition 12.7). 
For any choice of the other parameters, if 62 is sufficiently small, then TLC 32 ri(pL, t^l) 
satisfies the assumptions of [30, Theorem 1.4] for any L E W U y. Moreover, if fi is a 
TTL-approximation, denote by h^ its smoothed average and by hi the map PTp^E(hi,), which 
takes value in the plane xl ■= TpfP Anf, i.e. the orthogonal complement of til in TpfP. 
If we let hi be the map x i-A Ll^x) := {hL{x),^p^{x,hL{x))) E xl x Tp^T,-^, then there is 
a smooth map gi ■ i?4r^(pi,, tto) —)■ Tif such that Gg^ = G/i^LC4r^(p2,,7ro). 

We are now ready to finally specify how we patch the interpolating functions. The center 
manifold will then be a graph which is obtained by a suitable limiting process. 
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Definition 12.11 (Interpolating functions, cf. [28, Definition 1.16]). The maps hi and 
ql in Lemma 12.10 will be called, respectively, the tilted L-interpolating function and the 


L-interpolating function. 

Set 


For each j let := 5^^ U lJi=Aro ^ define 


(pj := 




on ] -4,4[”^, 


(87) 


let g)j{y) be the first n components of (pj{y) and define g)j{y) = ( 99 ^( 1 /), ^(i/, 99 ^( 1 /))). (pj 
will be called the glued interpolation at the step j. 


Theorem 12.12 (Existence of the center manifold, cf. [28, Theorem 1.17]). Assume that 
the hypotheses of Lemma 12.10 hold and let k := min{eo/2,/92/4}. For any choice of the 
other parameters, if 62 is sufficiently small, then 

(i) WDipjWc'^,^ < Cml'^ and Hv^jUco < , with C = C{/ 32 , S 2 , Mq, Nq, Ce, Ch)■ 

(ii) if L E and H is a cube concentric to L with i{H) = |7(L), then <pj = (pk on H 
for any j,k > i + 2. 

(in) ipj converges in to a map (p and M. := Gr((^|]_ 4 ^ 4 [m) is a submanifold o/S. 

Definition 12.13 (Center manifold and contact set cf. [28, Definition 1.18]). The manifold 
A4 in Theorem 12.12 is called a center manifold ofT relative to ttq and (F, W) the Whitney 
decomposition associated to M. Setting ^(y) := (y,<p(y)), we call ^(F) the contact set. 


13. The estimate for the center manieold 


Theorem 12.12 is probably the most complicated part of the proof of Theorem 3.4. 
However it is important to notice that Theorem 12.12 alone does not encode the full 
strength of the construction described above: we will see in the next section that the 
graph of (p is indeed a very good substitute for the “average of the sheets of a Q-valued 
graph”. For instance, the algorithm can be applied even under the assumption that the 
density of T equals Q ||T||-almost everywhere in C; in this case the refinement procedure 
never stops, F equals [—1,1]™ and finally spt(T) n [—1,1]™ x C Gr((^). Namely, by the 
constancy theorem TL([— 1 , 1 ]™" x tt^) = QG^ and thus we have gained two more derivative 
in the conclusion of Theorem 4.2. 

The latter surprising conclusion, namely that with a “purely geometric construction” it 
is possible to improve the classical e-regularity theorem by 2 derivatives, is already observed 
in the introduction of Almgren’s monograph. It is however possible to find a rather fast 
shortcut to this conclusion since multiple valued functions are not needed. A self-contained 
“elementary” proof has in fact been given in [25]. The latter reference contains, in a highly 
simplified setting, the most important estimates which hide behind Theorem 12.12. In this 
paragraph we will give a rough idea of the regularity of (p. 

13.1. Propagation of the estimates along the scales. It is obvious that in order 
to show Theorem 12.12 we need to prove a uniform estimate on for any L G W. 
This alone will not suffice: as it is simple to conclude from the usual properties of a generic 
partition of unity, we also need to show that, when L and H are two neighboring cubes 
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in then \\D\gL — gH)\\c^ < , for every I G {0,1,2,3}. However, leaving 

aside the “interaction” between nearby cubes, let us focus on ||5 'l||c3 and, to simplify the 
matter even further, let us in fact look at ||/il||(73 . It is rather plausible that if we could 
prove a uniform bound on IlhiUcs for the tilted interpolating functions this will not be 
destroyed by the change to the coordinates ttq x tt,}. 

Let us therefore fix L E W and let L = C Lj+i C ... C L^Vo be a chain of dyadic cubes 
where Lj_i is the father of Lj for every j. Now, if we ignore the Riemannian manifold S 
and assume that it coincides with is the convolution of a Lipschitz function 

at a scale which is obviously comparable with 1 (since i{L]\[o) = and Nq is a hxed 

“constant”, although rather large). Thus \\D’‘hL^^\\co is in fact bounded a-priori with a 
constant depending only on k (and on all the other parameters of the construction). We 
next want to study how the norm \\D^hL^\\co might increase compared to \\D^hL^_^\\co: the 
hope is that this can be bounded by some power of i{Lj), leading in turn to a convergent 
geometric series when k G {0,1, 2, 3}. This would then give a uniform bound on ||/il||(73 . 

Ideally we would like to compute \\D^{hL. — This is however not really 

possible, since the two functions are dehned according to two different coordinate systems 
(namely x and x Let us however assume, for the sake of argument, 

that TiLj = TJ'Lj-i =: Moreover, to simplify the notation let us denote Lj by J and 

by H. 

Under this assumption, hj is the convolution of rjo fj and hn the convolution of rjo f^. 
We know that both fn and /j approximate very accurately the area minimizing current T, 
at two scales which are comparable by a factor 2. Thus, for both graphs Gfj and the 
hrst variation is close to 0, which in turn, given the smallness of the excess of the current 
at that scale, should imply that both rj o f j and t) o are almost harmonic. 

The latter discussion is correct but must be quantihed. It is not difficult to see that it 
can be translated into an estimate for A(77 o /j) and A(t 7 o f^j) in some negative Sobolev 
space (more precisely we use the W~^’^ norm, cf. [28, Proposition 5.1]). To simplify the 
matter even further, let us assume that both rj o and rj o fj are in fact harmonic. 
The regularization by convolution will then not change the two functions, because the 
convolution kernel is radial. Thus we would have ||Zl^(hiii-—/ij )||(70 = ||T)^(t 7 o/ji^— r/o/j)||(^o. 
On the other hand, again by the mean-value formula for harmonic function, we could 
estimate 

\\D^{hH - hj)\\co < ofH-r]o /jH^i . (88 ) 

Let us however recall that the graph of fn coincides with the current T, except for a set of 
measure Gf'(iL)™'E(T, In turn the excess E(T, Bj^) is of size £(iL)^“^^L Since an 

analogous consideration holds for J in place of LT, we conclude that the two maps fn and 
fj coincides except for a set of measure at most £(iL)™+(2-25i)(i+7i)_ Qn the other hand 
the “heights of both maps” is of order i.e. the available estimate for the heights 

h(T, B//) and h(T, B^). Combining these observations, we conclude that 

||r;o/H-77o/j|Ui <G£(/7)”^+3+._ 
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Inserting the latter ineqnality in (88) we would then conclude 

\\D\hH-hJ)\\c.<C^{Hf+^-\ 

13.2. The key PDE estimate. In order to carry on rigorously the program outlined 
in the previous paragraph, rather than comparing directly huj and hj, we introduce an 
intermediate map constructed in the reference cube L relative to a system of coordinates 
which is suitable for anbother J. The following proposition is the technical tool which 
ensures the existence of the relevant maps. 

Proposition 13.1 (Existence of interpolating functions, cf. [28, Proposition 4.2]). Assume 
that the hypotheses of Assumptions 12.1 and 12. f hold, that > C* and Ch > C*Ce, where 
C*{Mq, Nq) is a suitable constant. The following facts are true provided €2 is sufficiently 
small. Let H,L he such that either H<zL or and < ^{H) < i{L). 

Then, 

(i) for 71 = 7TH,7rH, (P7r)tt^LC32ri(pL,7r) = Q (Pl, tt))] and T satisfies the as¬ 

sumptions 0 / [30, Theorem 1.4] in the cylinder Cz 2 rL{pLiT^); 

(ii) Let fuL be the tth- approximation ofT in Csr^iPL, t^h) and h := {rjo f^if) * he 
its smoothed average. Set mh ■= Tif flTp^S and consider the maps 

X H- h{x) ;= VTp^T.{h) _ e xh 

X H- hHL{x) -.= {hlx),'^pij{x,h{x))) e xh X (Tp^(S))-^ . 

Then there is a smooth gni '■ Biri^{pL-,T^o) —t s.t. LC 4 ,.^(Pl, tto). 

Definition 13.2. hui and qhl will be called, respectively, tilted {H, L)-interpolating func¬ 
tion and {H, L)-interpolating function. 

Observe that the tilted (L, L)-interpolating function and the (L, L)-interpolating func¬ 
tion correspond to the tilted L-interpolating function and to the L-interpolating function 
of Dehnition 12.11. Obviously, Lemma 12.10 is just a particular case of Proposition 13.1. 
We are now ready to detail the main analytical reason behind the estimates claimed in 
the previous paragraph: the point is that the vanishing of the hrst variation of T can be 
translated into a suitable elliptic system for the interpolating functions. However, since 
the current T is constrained to belong to a given Riemannian submanifold S, the elliptic 
system necessarily involves only some components of the maps 

Definition 13.3 (Tangential parts, cf. [28, Dehnition 5.1]). Having hxed H G and 
TT := tth C TpjjT,, we let x be the orthogonal complement of tt in Tp^T,. For any given 
point q G any set 12 C tt and any map f : q -\- Q ^ 71 -^, the map Px ° ■C will be called 

the tangential part of f and usually denoted by f,. Analogous notation and terminology 
will be used for multiple-valued maps. 

Proposition 13.4 (Elliptic system, cf. [28, Theorem 5.2]). Assume 82 is sufficiently small 
and the assumptions of Proposition 13.1 hold. Let H G and L be either an ancestor 

or a cube of fl with H n L ^ ij) (possibly also H itself). Let fni ■ Bsr^^ipL, t^h) —t 
the 71H-approximation ofT in C8r^(pL, T/f), hni the tilted {H, L)-interpolating 
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function and fuL and Jihl their tangential parts, according to Definition 13.3. Then, there 
is a matrix L, which depends on S and H but not on L, such that |L| < CqA? < CoTHq 
for a geometric constant Co and (for C = C(/92, ^ 2 , Md, Nq, Ce, Ch)) 




(rLllCllci + IICIIco) 


{Diri ° fur) : - Ph)Y ■ L ■ C) 

for every C e Cf^iBgr^iPi, ttl), x). Moreover (for C = C{l32, <^ 2 , Mq, Nq, Ce, Ch)) 

\\hHL -r]o /hl||li(B 7 < Cmor™+^+^^. 


(89) 

(90) 


Observe that, if we ignor the effect of the ambient Riemannian manifold, (89) is an 
estimate on the W~^’^ norm of A{r] o fni)- 


14. The approximation on the normal bundle of the center manifold 


To carry on our program for proving Theorem 3.4 by “blow-up”, we now need to ap¬ 
proximate again our area minimizing current in a cylinder where the excess is small, taking 
advantage of the center manifold. Let Ai = Gr((^) be the center manifold constructed in 
the previous sections and let us make some first considerations. 

First of all, by the C^’'^ estimates, we know that in a sufficiently small neighborhood U of 
Ai there is a C^’^ orthogonal projection p ; U ^ A4 which to each p G U assigns the unique 
point q = p(p) such that p — g is normal to TqAi. In fact, since ||Zl(p||c 2 ,K < and mo 

can be chosen arbitrarily small, the “thickness” of U can be assumed to be of any given 
size, say 1. In turn, since the height of T in C is of order raYA"^, we can certainly assume 
that spt(TLCi/2(0)) C U. It is also not difficult to see that T is a Q-fold covering of Ai, 
namely (ptt(TLCi/2)) LC1/4 = Q [Ai (1 C1/4] (we need to restrict slightly the radius of the 
cylinder to avoid “boundary effects”). We could dehne a “curvilinear excess” compared to 
Ai with the following procedure: at each point p G spt(T) we measure the distance between 
T(p) and the oriented tangent plane to Ai at the projection p(p). We then integrate the 
square of this quantity over spt(T) fl C 1 / 2 . The corresponding formula is 



|f(p)-Tp(p;Afpd||T||(p). 


It is no surprise that the latter is controlled by the “straight excess” in the cylinder C, 
simply because the tilt between T^Ai and ttq is controlled by ||Zi)(p||( 70 , for which in turn 
we have the bound Cmf . Thus, as it happens with the “straight excess” we can expect to 
be able to approximate T efficiently with a multivalued map N defined on Ai and taking 
values in the normal bundle of Ai. 


14.1. Preliminaries. In order to state the corresponding theorem precisely we need how¬ 
ever to introduce some suitable objects. 

Assumption 14.1. We £x the following notation and assumptions. 

(U) U := {t G : 3! p = p(a;) G Ai with |a; — p| < 1 and {x — y) A Ai}. 

(P) p ; U —)■ AI is the map defined by (U). 
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(R) For any choice of the other parameters, we assume 62 to be so small that p extends 
to and p“^(|/) = y + -Bi(0, (TyM)-^) for every y G M.. 

(L) We denote by (9;U ;= p“^((9Wl) the lateral boundary of U. 

The following is then a corollary of Theorem 12.12 and the construction algorithm, which 
summarizes the discussion of the previous paragraph. 

Corollary 14.2. Under the hypotheses of Theorem 12.12 and of Assumption If.l we have: 

(i) spt((9(TLU)) C (9/U, spt(TL[—|, x R”) c U and pjj(TLU) = Q |A1]; 

(ii) spt((T, p, $(g))) C { 1 / : \^{q)—y\ < j for every q E L e W, where 

C = C{/32,S2,Mo,No,Ce,Ch); 

(hi) (T, p,p) = Q Ip] for every p G $(r). 

We can now introduce multivalued maps on the normal bundle of AA. 

Definition 14.3 (Al-normal approximation, cf. [28, Dehnition 2.3]). An Ai-normal ap¬ 
proximation of T is given by a pair (/C, F) such that 

(Al) F : Ad —)■ AIq(U) is Lipschitz (with respect to the geodesic distance on M.) and 
takes the special form F{x) = Yhi + -^*( 2 ^)]) with Ni[x) T T^M. and x + Ni{x) G 
S for every x and i. 

(A 2 ) /C C Ad is closed, contains #(r fl [—|, |]"^) and TirLp“^(/C) = TLp~^(/C). 

The map N = 1-^*1 • AIq(R™^”) is the normal part of F. 

14.2. Main approximation theorem. In the dehnition above it is not required that the 
map F approximates efficiently the current outside the set 4>(r fl [—|, |]™). However, we 
need a map which approximates T with a high degree of accuracy. Moreover, the accuracy 
must be coupled with the local information coming from the Whitney decomposition. For 
this region we introduce the ‘Whitney regions” on the center manifold Ad. 

Definition 14.4 (Whitney regions, cf. [28, Dehnition 1.18]). To each L G we associate 
a Whitney region £ on Ad as follows: 

(WR) £ ;= 4>(id n [—|, l]^), where H is the cube concentric to L with i{H) = ^i{L). 

Theorem 14.5 (Local estimates for the Ad-normal approximation, cf. [28, Theorem 
2.4]). Let 72 := with 71 the constant of [30, Theorem 1.4]. Under the hypotheses of 
Theorem 12.12 and Assumption If.l, if 62 is suitably small (depending upon all other 
parameters), then there is an Ai-normal approximation (/C, F) such that the following 
estimates hold on every Whitney region £ associated to a cube L G W, with constants 

C = C{(32,52,Mo,No,Ce,CH): 

Lip(Ar|^) < CmlH{Ly^ and \\N\c\\co < (91) 

|£ \ /C| + ||Ti. - T||(p-^(£)) < C'mo+^^£(£)”^+2+72^ (92) 

y \DN\^ < . (93) 
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Moreover, for any a > 0 and any Borel V C C, we have (for C = C(/92, <^ 2 , Mq, Nq, Cg, Ch)) 
[ \rioN\<Cmo{i{Lr+^+^^^^ + ai{Lf+^^/^\V\) +- [ g{N,Q {t] o (94) 


Before proceeding, we wish to discuss why the estimate (94) is of crucial importance. 
Our plan is to show that N is close to a Dir-minimizing Q-valued map and that a large 
singular set for T induces a large singular set on the latter map. The first point was, in 
“straight coordinates”, an effect of the Taylor expansion of the area functional of a graph. 
Thus, it makes sense to compare the mass of Ti? with that of Q |A^]. 


Theorem 14.6 (Expansion of M(Ti?), cf. [27, Theorem 3.2]). If Ai, F and N are as 
above and the Lipsehitz eonstant of N is suffieiently small, then 

M{Tf) = QH"^{M)-Q f {H,r]oN) + l- [ \DN\‘^ + R.O.T., (95) 

Jm ^ Jm 

where H is the mean eurvature veetor of M and H.O.T. eontains higher order terms, 
namely expressions that are at least trilinear in N. 


Notice in particular that there is a nonvanishing linear term in the expansion. In order 
to show that N is quasi-harmonic, we therefore need {H, rf o N) to be much smaller than 
\DN\\ 

Before explaining the ideas behind the proof of Theorem 14.5 we wish to remark that 
from (91) - (93) it is not difficult to infer analogous “global versions” of the estimates. 

Corollary 14.7 (Global estimates, Cf. [28, Corollary 2.5]). Let A4' be the domain 
$([—1,|]™) and N the map of Theorem If.5. Then, for C = C{/32,S2, Mq, NQ,Ce,Ch), 

Lip {N\M') < Cm(f and ||iV|;K/||c 70 < (96) 

\M' \ /C| + ||T^ - T||(p-i(M')) < (97) 



\DN\^ < Cmo. 


(98) 


14.3. A sketch of the proof. Recall hrst that (T, p,p) = Q |p] whenever p G ^*(r) and 
thus we set N{p) = Q |0] for any such p: it remains therefore to dehne N on each region 
of the form $(T) when L G W. 

Let us £x L G W. Going back to the construction of the center manifold, we discover 
that we already have a rather accurate graphical approximation of T in the region of our 
interest, since we already considered the til approximation fi. If fL were a classical single 
valued function, we could simply parametrize its graph on the normal bundle of A4. Indeed 
the tangent planes to the graph of fi are certainly close to til and on the other hand A4 
is constructed patching a suitable smoothing of the average t] o fi- Thus, in the cylinder 
C6ri(PL, ttl), the angle between a generic tangent plane to the graph of fi and a generic 
tangent plane to Ai is rather small. In the Q-valued setting “reparametrizing” graphs is 
a much more subtle issue than in the classical single valued setting . However it is not 
very hard to prove a suitable theorem (see [27, Theorem 5.1]) which allows to describe the 
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graph of fi through a Lipschitz map on the normal bundle of Ai. Note moreover that 
Nl can be dehned on a much larger domain than ^{L). 

We next have to face a new difficulty: if L and H are two nearby cubes, the maps Nl 
and Nh do not necessarily agree on the intersection of their domains. However, recall that 
the graphs of fi and fn coincide with the current T except for two sets of small measure. 
Thus the values of Nl and N^ coincide on a very large portion of the set on which both are 
dehned. In turn, the construction algorithm ensures that each H intersects only a hnite 
number of other cubes in W\ such number is bounded a-priori by a geometric constant 
(for instance, when m = 2 each square of 'W can intersect at most 12 other squares of W). 
So, after removing from each $(T) all those points where Nl does not coincide with all 
the Nh related to neighboring cubes, we reach a uniquely dehned map iV on a rather large 
subset of \ $(r). 

We now wish to extend this map to a Lipschitz one dehned on all Ai. It is not difficult to 
see that N is already globally Lipschitz on its domain of dehnition and that it approaches 
the value Q |0] on sequences converging to $(r): this is because the height of /l (and thus 
that of Nl) is controlled by and cubes in W close to F necessarily have small 

sidelength. However, it does not serve our purposes to give a global Lipschitz extension 
of N which does not respect the local properties of the map. In particular we desire an 
extension that on each Whitney region C has small Lipschitz constant: the smaller the 
scale, the smaller should be the Lipschitz constant. To achieve this property we follow 
an elementary idea, which we next describe in the special case of dimension m = 2. As a 
matter of fact, since # is Lipschitz we can, for the sake of our discussion, assume that ^{L) 
is flat and coincides with the square L. We £x the four points A, B, C and D which are the 
four vertices of the square. We first wish to extend N to four small neighborhoods of these 
points (if N is not already defined there); we will call the neighborhoods U{A), U{B), U{C) 
and U{D) and we will fix them to be disks centered at the respective points with radius 
£(L)/4. 

Take for instance A: the latter point is a common vertex for four squares Hi, H 2 , L 
of W. We then consider the intersection of the domain of N with Hi U H 2 U U L: if 
we restrict the map N to this set and consider its Lipschitz constant, we can then use 
the Lipschitz extension theorem for multivalued functions to extend it to a neighborhood 
U{A) of A, without increasing such constant by much. We proceed and extend the map 
separately to neighborhoods of A, B, C and D. However when we extend the map to 
the neighborhood U{B) we disregard what we did in the neighborhood of A and we only 
take into consideration the “original” N: having chosen such neighborhoods sufficiently 
small the distance between two points p and q which belong to distinct neighborhoods 
(say U{A) and U{B)) is larger than f'(L)/2 and the height of the extension is no larger 
than thus providing automatically a good Lipschitz bound on U{A) U U{B). 

This procedure can then be repeated for all squares and we have a new map N' which is 
evidently defined in a neighborhood of any vertex of any L G W. 
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With the same principle we extend N' to neighborhoods U{AB), U{BC), U{CD), U{DA) 
of the corresponding sides of the square L. This seems more problematic because, for 
instance, U{AB) and U{BC) intersect in a neighborhood of B. However N' is already 
dehned on U{A) U U{B) U U{C) U U{D). Thus we need to extend it to the sets V{AB) = 
U{AB) \ {U{A) U U{B)) and V{BC) = U{BC) \ {U{B) U U{C)). The latter can be now 
assumed to be disjoint and separated by a distance of the order coi{L) for some cq > 0: 
it just suffices to choose the thickness of the neighborhoods U{AB) and U{BC) much 
smaller than the thickness of the neighborhoods U{A), U{B) and U{C). We can then 
literally argue as above and gain a second extension of the map, say N", which is defined 
on the boundary of any square L of W. At this point the third (and hnal) extension is 
achieved by considering each square separately. 

15. Blow-up I: preliminary considerations and frequency function 

We are now ready to discuss the proof of Theorem 3.4. We assume by contradiction that 
the theorem is false and, recalling Proposition 11.2, we fix an area minimizing current T 
of dimension m, a plane ttq (which without loss of generality we assume to be MD x { 0 }) 
an integer Q > 0 and a sequence of radii j, 0 with the following properties: 

• The excess E(To,rfc, C 8 ( 7 ro, 0)) converges to 0 as /c t oo; 

• The point 0 is singular, 0(T, 0) = Q and fl SingQ(To,,.j^)) > 17 > 0. 

We spend the next paragraph to make our discussion above precise. In particular from 
now on we will assume the following hypotheses: 

Assumption 15.1. Let Sq ^]0, 1[, m,h G N \ {0} and I G N. We denote by 

(M) S C an embedded (m-|-h)-dimensional submanifold of class 

(C) T an integral current of dimension m with compact support spt(T) C S, area 
minimizing in S. 

Assumption 15.2 (Contradiction). There exist m > 2,n,/, S and T as in Assumption 
15.1 such that 'H™“^+“(Sing(T)) > 0 for some a > 0. 

Moreover, let us recall the following subsets of Reg(T) and Sing(T), introduced a few 
sections ago. 

Definition 15.3 (Q-points). For Q G N, we denote by Dg(T) the points of density Q of 
the current T, and set 

RegQ(T) := Reg(T) n Dq(T) and SingQ(T) := Sing(T) n Dq(T). 

The hrst key player in the proof of Theorem 3.4 is then the following sequence of rescal¬ 
ings of the current T in Assumption 15.2. 

Proposition 15.4 (Contradiction sequence, cf. [29, Proposition 1.3]). Under Assumption 
15.2, there are m,n,Q >2, Yi and T as in Assumption 15.1, reals a, 77 > 0, and a sequence 
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Tfc 0 such that 0 G Dq(T) and the following holds: 

lim E(To^rfc, Bg^) = 0, (99) 

fc^-l-oo 

lim n Bi) > ,,, (100) 

fc^ + OD 

n spt(To,.J) \ Dg(To,.J) >0 ykeK (101) 

In particular, by Proposition 15.4 and simple rescaling arguments, we assume in the 
sequel the following. 

Assumption 15.5. Let £3 G] 0 ,e 2 [- Under Assumption 15.2, there exist m,n,Q > 2, 
a,!] > 0, T and S for which: 

(a) there is a sequence of radii 0 as in Proposition 15.4; 

(b) the following holds: 

ToS = X {0}, spt(9T)nB6^ = 0, 0 e Dq{T), (102) 

||T|| (Bg^,) < r- (g + 4 ) for all r G (0,1), (103) 

c(S n By^) < £3 . (104) 

15.1. A sequence of center manifolds. We wish to approximate the current with an 
A4-normal approximation N over a center manifold Ai. A first attempt could be the 
following: for some r sufficiently large, the excess E(To,r, Cr(7ro, 0)) will be sufficiently 
small and So,r := <-o,r(U) will be sufficiently flat. Assuming without loss of generality that 
r = 1, we can then construct a center manifold Ai and an Al-normal approximation. 
However, we have no guarantee that this approximation is accurate at very small scales 
around 0. This would certainly be the case if 0 belonged to the contact set P of Dehnition 
14.4, but of course it might be the case that 0 belongs to some cube L E W where the 
rehning procedure has stopped. If this is the case at a certain small scale around 0 the 
graph of N might have a completely different behavior than T. 

If we set ti := 1, we can then distinguish two situations. In the first one 0 G P and we 
are thus satisfied with our center manifold and our approximating map, which we denote 
by Ail and Ni. In the second one at some distance si from the origin we encounter the 
hrst cube of size sufficiently large compared to si, say c^si for some specihed constant 
Cs (this number turns out to be l/(16i/m) in our proof, cf. [29, Section 2]: its precise 
value is however not important, as long as it is a hxed parameter which does not depend 
upon those which enter the construction of the center manifold). We can assume that Si 
is (much) smaller than ti\ we just need to set the parameter Nq in the construction of the 
center manifold accordingly large. 

At the scale Si the two objects Aii and Ai, i.e. the center manifold and the corre¬ 
sponding approximation, are not anymore serving our purposes. Now consider the 

latter current might or might not have sufficiently small excess to construct another center 
manifold. In the first case we set t 2 = -Si, otherwise we let t 2 be the first radius smaller 
than Si at which the excess goes below the desired threshold. We then construct the pair 
Ai 2 and N 2 taking reference area minimizing current. 



64 


CAMILLO DE LELLIS 


We formalize the above discussion in the following way. We set 

:= {r e]0,1] : E(T, . (105) 

Observe that, if C TZ and Sk t -s, then s eTZ. We cover IZ with a collection T = {Ij}j 
of intervals Ij =]sj, tj] dehned as follows, to := max{f : t G TZ}. Next assume, by induction, 
to have dehned tj (and hence also to > -So > ti > Si > ... > > tj) and consider the 

following objects: 

- Tj := ((to,o)tt^)LB 6 ^, Tj := io,tj{T) fl By^] moreover, consider for each j an 
orthonormal system of coordinates so that, if we denote by ttq the m-plane R™' x 
{ 0 }, then E(Tj,B6^,7ro) = B{Tj,Bo^) (alternatively we can keep the system of 
coordinates hxed and rotate the currents Tj). 

- Let Aij be the corresponding center manifold constructed in Theorem 12.12 applied 
to Tj and Tj with respect to the m-plane ttq; the manifold M.j is then the graph of 
a map : ttq D [—4,4]”^ —)■ TT^f, and we set ^j{x) := (x, (Pj(x)) G ttq x 

Then, we consider the Whitney decomposition of [—4,4]”^ C ttq as in Dehnition 12.6 
(applied to Tj) and we dehne 

Sj := max (^{c~^£(L) : L G and c~^l{L) > dist(0,L)} U {0}) . (106) 

It can be shown, see below, that Sj/tj < 2“®. In particular this ensures that [sj,tj\ is a 
(nontrivial) interval. Next, if Sj = 0 we stop the induction. Otherwise we let fj+i be the 
largest element in 7^n]0,Sj] and proceed as above. Note moreover the following simple 
consequence of (106): 

(Stop) If Sj > 0 and f := Sj/tj., then there is L G vvith 

i{L) = Cgf and L fl 5^(0, ttq) 7 ^ 0 (107) 

(in what follows Br{p, n) and Br{p, n) will denote the open and closed disks Br(p) 0 

(p + tt), B^(p) n (p + tt)); 

(Go) If p > f := Sj/tjj then 

i{L) < CsP for all L G with L 0 5^(0, ttq) 7 ^ 0. (108) 

In particular the latter inequality is true for every p g]0, 3] if Sj = 0. 

The following is a list of easy consequences of the dehnition. Given two sets A and B, we 
dehne their separation as the number sep( 74 ,i?) := inf{|a; — y\ : x E A,y E B}. Moreover 
we will use the notation Br{p) for the geodesic balls in Ai. 

Proposition 15.6. Assuming £3 sujficiently small, then the following holds: 

(i) Sj < ^ and the family ZF is either countable and tj f 0, or finite and Ij =]0,fj] for 
the largest j; 

(ii) the union of the intervals of T cover TZ, and for k large enough the radii r^ in 
Assumption 15.5 belong to TZ; 
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(iii) if r g]|^,3[ and J G intersects B ;= Vnoi^riPj)), with pj := then J is 

in the domain of influence Wn\H) (see [28, Definition 3.3]j of a cube H G 
with 

3?^ 

i{H)<3csr and max{sep {H, B),sep {H, J)} < <—; 

16 

(iv) E(Tj, B,,) < for every r g]|2-, 3[. 

(v) sup{dist(a;, A^j) : x G spt(Tj) {Br{pj))} < Cq every r g]|^,3[, 

where m( := niax{c(Sj)^, E(Tj, Bg^)}. 

15.2. Frequency function. For each interval of flattening we have a center manifold Aij 
and an approximating map Nj. Since each in Proposition 15.4 belongs to some interval 
of flattening [sj{k),tj{k)], we now hope to hnd suitable rescalings of the maps which 
converge to a Dir-minimizing map and which inherits the singularities of the currents , 
thus contradicting the regularity theory for Dir-minimizing functions. In other words, we 
hope to succeed in the very same program which failed in Section 11. The reason is that 
the Nj are now well-centered. In particular we claim that we have a frequency function 
estimate for the iVj’s, similar to the estimate for Dir-minimizing maps. 

Consider the following Lipschitz (piecewise linear) function 0 : [0 -|- cxd[—)■ [0, 1] given by 

fl forrG[0,i], 

0(r) ;= < 2 — 2r for r G ]|, 1], 

1^0 for r G ]1, -1-cxd[. 

For every interval of flattening L =lsi,td, let iV, be the normal approximation of T~ on 
Mj in [28, Theorem 2.4]. 


Definition 15.7 (Frequency functions). For every r g] 0,3] we dehne: 


D,(r) := 


'Mi 


dj{p) 


\DNj\‘^{p)dp and Hj(r) : = 


'Mi 


dj{p)\ \Nj\^{p) 
dj{p) 


dp, 


where dj{p) is the geodesic distance on Aij between p and ^*j(0). If Hj(r) > 0, we dehne 
the frequency function Ij(r) := . 

The following is the main analytical estimate of the paper, which allows us to exclude 
inhnite order of contact among the different sheets of a minimizing current. 


Theorem 15.8 (Main frequency estimate, cf. [29, Theorem 3.2]). If £3 is sufficiently 
small, then there exists a geometric constant Cq such that, for every [a,b] C [ 7 ^, 3 ] with 
Hj|[a,b] > 0, we have 

I,(a) <C'o(l + I,( 6 )). (109) 

Recalling the proofs of Theorem 15.8 and Theorem 9.6, the monotonicity of the corre¬ 
sponding quantity for Dir-minimizing functions rely on two identities which correspond to 
“internal” and “external” variations, cf. Proposition 9.2. In turn both variations can be 
interpreted as suitable deformations of the graphs of the functions. We then know that: 
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• The variations of the Dirichlet energy for Nj is close to the variation of the mass 
of its graph; 

• The variation of the mass of its graph is close to the variation of the mass of , 
which in turn is 0 because it is area minimizing (this must be suitably adjusted if 
the minimizing property is inside some ambient manifold S). 

We thus can write down identities which correspond to those of Proposition 9.2 but will 
include several error terms. In particular, quite careful computations give the following 
formulae, where, for the moment, we drop the subscripts j. 


Theorem 15.9 (Expansion of outer variations, cf. [27, Theorem 4.2]). If G C^(M) and 
X{p) := (p{p{p)){p - p{p)), then 


r r ^ 

6Tf{X) = U \DN\^ + V(W ® Dp) : Dk) -Q p{H, ry o iV) + V Err 
Jm^ i ^ ^ Jm , 

Erri 


where 


\Ett,\<CI \p\\a\^W 
Jm 


lErrsI <C (^|(^| (|ZliVniV| |A| + |ZliV|") + (|ZliVHiV| + |ZliV| liVlVl) 


( 110 ) 

( 111 ) 

( 112 ) 


Let T be a vector held on TAl with compact support and dehne X on U setting 
X{p) = y'(p(p)). Let be any isotopy with Tq = id and = Y and 

dehne the following isotopy of U; ^^(p) = 'L£(p(p)) + (p — p(p))- Clearly X = *h£- 


Theorem 15.10 (Expansion of inner variations, cf. [27, Theorem 4.3]). If X is as above, 
then 

6TpiX)= [ (\£^divMY-J2DN,-.{DN,-DMY))+J2^TT^, (113) 

Jm ^ ^ i J i=i 

where 

Erri = -g/ {{H,r)oN)dYVMY + {DyH, 7)0 N)), (114) 

Jm 

\Ett 2\<C [ \A\mDY\\N\‘^ + \Y\\N\\DN\), (115) 

Jm 

lErral <C f (|T||7l||ZlX|2(|X| + |Z1X|) + \DY\{\A\ \N\^\DN\ + . (116) 

Jm ^ ^ 


16. Blow-up II: Splitting before tilting and bound on the frequency 

FUNCTION 

In order to prove Theorem 15.8 we hope to achieve two important conclusions from the 
center manifold construction: 
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(A) Each time we start one of the center manifolds Mj, the initial value of the frequency 
function, say Ij(l), is bounded a-priori independently of j; 

(B) After applying the first variations, with the help of Theorem 15.9 and Theorem 
15.10 we hope to derive an inequality of the form 

^Mt) > -E(r)l,{r ), 

which in turn gives the monotonicity of Ij(r)e-^o E(T)dT^ 

Both points are the effect of a phenomenon which we call “splitting before tilting” inspired 
by Riviere’s important paper [53]. 

16.1. The splitting before tilting phenomenon. Fix a center manifold Ai (we drop 
the subscript to simplify the notation) and consider again the algorithm dehning the cor¬ 
responding Whitney decomposition, as in Dehnition 12.6. Ignoring for the moment the 
special cubes L which stopped because of the condition (NN), we must remember that all 
other cubes must have stopped for one of the following two very precise reasons: 

(h) either the height in exceeds 
(e) or the excess in exceeds 

In the hrst occasion we use an appropriate “layered version” of the classical height bound 
for currents with small excess. In order to avoid too many technicalities, we state the 
bound in a rather informal way. 

Theorem 16.1 (Layered height bound, cf. [28, Theorem A.lj). Let T satisfy Assumption 
10.7 in the cylinder C4,.(0,7ro) and let E := E(T, 04 ^( 0 , tto). In a slightly smaller cylinder 
Cp(0,7ro) the current is then supported in k < Q disjoint layers of the form Bp{0,no) x 
BcpEy^"'{yi:^o)' layer the density 0(-,T) does not exceed a certain integer Qi by 

much and Qi = Q. 

In particular, if Ch is relatively large compared to Ce, at any cube L where the rehning 
procedure has stopped by the (HT) condition the current will form at least two separate 
“layers” of thickness smaller than f(L)^+(i“^ 2 )/m parallel to ttl. The two layers are 
separated by a distance comparable to the height in B^,, say Thus, 

anywhere in a region of diameter Ci{L) which includes C (where C will depend on the 
constant Mg) we can expect \N\ to be at least as large as Thus, on 

every region 12 of measure ci{L)'^ close to C the size of |iVp is at least We 

refer to reader to [28, Proposition 3.1] (see also [62, Section 5.5]) for the precise statement. 

In the second case we would like to say that |Z2iVp is at least fQ^ g^y 

region 12 which is a ball in M of radius ci{L), sufficiently close to C. This is true but much 
more subtle. 

Recall that, if H is the father of L, the excess in B// is smaller than CeTnQi{HY~'^^^. If 
the parameter Mq is chosen very large, B^ and B/^- are almost concentric and the radius 
of Bi is twice the radius of B//. On the other hand we know that, in B/,, the current T 
can be approximated by a Dir-minimizing Q-valued map. If the latter were too close to 
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a multiple copy (with multiplicity Q) of a classical single valued harmonic function, then 
the argument illustrated in Section 4.1 tells us that the excess in should be almost j 
of that in cf. Remark 4.3. But this is not the case because the ratio is instead at 
least although small, the parameter 62 makes here a big difference! Thus in B^ 

T is close to a “non-classical” Dir-minimizer / = |/j], more precisely we can certainly 

assume that the Dirichlet energy of / = /*! is not too small compared to that 

of /. For such maps we have the important property that in no region of their domain of 
dehnition can the energy become too small. Passing now to the “curvilinear coordinates” 
we can infer the same conclusion for \DN\^ whenever is a region of a suitable size 
sufficiently close to C. For the precise statement we refer the reader to [28, Proposition 
3.4] (see also [62, Section 5.5]). 

Finally, it is necessary to achieve some contol upon the behavior of N in Whitney regions 
related to cubes where the rehning procedure stopped because of the (NN) conditions. 
However, it is not difficult to see that these cubes can be partitioned into families, each 
family being sufficiently close to some cube of type (HH) or (EX). In fact it turns out that 
only the second condition might really occur, cf. [28, Corollary 3.2]. The idea is simple: if 
L is an (NN) stopping cube, then it has a neighbor twice as big where the rehning procedure 
stopped as well. We can iterate this consideration until we hnd a cube of type (HT) or 
(EX). This cube might be of type (EX): the separation condition of type (HT) propagates 
to nearby cubes of half size by the layered height bound of Theorem 16.1. Consider now 
the union U of the Whitney regions relative to all the cubes (NN) which lead to the same 
cube L of type (EX) via the above procedure. Then the size of U is comparable to that of 
the Whitney region C and thus we can bound all the error terms in U with the Dirichlet 
energy of N in C. 

16.2. Uniform bound on Ij(l). If the intervals of flattening are hnite, then there would 
be nothing to prove. What happens however if the intervals are not finite? Consider some 
“starting point” tj. It essentially can be of two kinds: 

• = U-i) center manifold M.j-i had ceased to be “good” at scale 

and we encounter a cube L G which is of size Sj-ijtj-x where the rehning 

procedure has stopped. It is the hrst scale at which we encounter a large cube close 
to 0: for this very precise reason it is not difficult to show that it must necessarily 
be a cube where either the excess or the height condition fails, cf. Proposition 15.6. 
But then the discussion in Section 16.1 implies that the approximating map iVj_i 
is well separated at this scale: this means that the denominator of the freqnency 
function Ij_i is not too small compared to the nnmerator. It remains to transfer 
this information to the freqnency fnnction ly note however that at the scale which 
interests us there is a large overlap between the graphs of (snitable rescalings of) 
Nj and Nj-\ since they both approximate the same piece of the same current. 

• tj < Sj-i; in this case the excess of is not small enough to construct a center 
manifold, tj is then the first radius at which the excess goes below the desired 
threshold. As such we also know that E(T, B^^.) cannot be too small, i.e. the 
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current cannot be too close to a flat plain at the scale tj. But then we can expect 
that the denominator Hj(l) is not too small compared to the numerator 0^(1). 

This discussion can be made rigorous and we refer the reader to [29, Theorem 5.1] for the 
details. 


17. Blow-up III: einal arguments 


We have now gained a sequence of center manifolds and of approximations N^k), 

so that Vk belongs to ]si(k),ti(k)], where {r^} is the sequence of Proposition 15.4. Let us 
rescale the center manifolds by a factor rk/ti(^k), so to gain a sequence of manifolds Mk 
and maps Nk at “scale 1”, which should give rather good approximations of Tk := To,rfc- 
We £x for convenience a “central point” for each center manifold Aii: it will be the point 
Pi lying in the plane {0} x M”. Correspondingly pk := pi{k)ti{k)/rk is the central point of 
Mk- The geodesic balls with center q and radius p will be denoted by Bp{q) in any of these 
manifolds. We next normalize further the maps Nk dividing them by their norm”, 
namely by 


hfc ;= 



(117) 


i.e. we set 



(118) 


Observe that must be positive: by the discussion in Section 16.1, it can vanish only 
if jg empty. However in the latter case would coincide with Q copies of the 

(smooth) manifold which cannot be the case because 0 is a singular point. 


We wish to take a limit for (a suitable subsequence of) iV|: since M.k “flattens” (i.e. 
converges to ttq) we hope that the limit is a Q-valued map which has norm equal 
to 1, because the convergence is strong in Such strong convergence will be achieved if 
we could prove that the rescaled maps have bounded Dirichlet energy, namely if we had 
an inequality of type 

[ \DN^^\^<C, (119) 

JbPpu) 

for some constant C which does not depend on k. In turn this bound corresponds to a 
“reverse Sobolev” inequality for the Npk)) he. 




\DNpk)?<Cr 


-2 


'Br{pi(k)) 



On the other hand this is precisely the bound that a bound on the frequency function as 
in Theorem 15.8 is expected to yield. For the details we refer the reader to [29, Corollary 
5.3]. 
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17.1. The final map We have now finally gained our “blown-up” map N^, which 
is the limit (up to subsequences) of the maps N^, and we know that it is a Q-valued map 
on i?i(0,7ro). Note that the estimate (94) (and the lower bounds discussed in Section 16.1 
deliver the extra information that t] o = 0. This also helps us in the Taylor expansion 
of the area functional to conclude that is Dir-minimizing, cf. Theorem 14.6 (we are 
of course ignoring the complications given by the ambient Riemannian manifold S). We 
now wish to succeed where the strategy outlined in Section 11 failed, i.e. in showing that 
the blown-up map must “remember” the singularities of the rescaled currents. Note 
however that we just need to show that the lower bound 

wr^+“(BinSiiigo(r„,,.))>,,>0 

induces a similar lower bound on the Hausdorff measure of the set of points p where 
NLip) = Qh oiV^(p)l. Indeed, from such a lower bound and Theorem 8.4 we would 
conclude that is a classical single valued harmonic function counted Q times. On the 
other hand r} o = 0 would then imply = Q |0]. This would then finally contradict 
what we concluded from the previous section, namely that 

/ = 

Hence, consider the closed sets O Bi) and let T be their Hausdorff limit (after 

extraction of a subsequence). We wish to show that most points in F are points of “mul¬ 
tiplicity Q” for iV^, or briefly Q-points for iV^. 

17.2. Persistency of singularities. Recall that is continuous (because it is Dir- 
minimizing). Thus a statement analogous to Theorem 11.3 would guarantee that all points 
of r are Q-points for N^. However such a theorem is not available at present. We rather 
show that the subset of points of F which fail to be Q-points of must be a set of 
'H'"“^’i'“-measure, which however is enough for our purposes. We finish therefore this 
survey by sketching the argument. 

Indeed we will follow a slightly different strategy. Summarizing what achieved so far, 
is a nontrivial Dir-minimizing map which has r} o = 0. Thus the set of points of 
multiplicity Q for coincides with the closed set of p where N^{p) = Q |0]. Such set 
must have j^ieasure zero. We can then identify a closed set A for which, at the 

same time, we have 

• For some positive -d 


j — rj o N^\‘^ > 2-d everywhere on A 


• A is the Hausdorff limit of closed subsets A*, of with 'H™' measure 

larger than 

By the Holder continuity of there must be a fixed constant a such that 



> d > 0 


Vq E A ,ya < a. 
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We now fix a positive a < a, whose choice will be specihed only at the very end. 
By convergence, for k large we inherit the inequality 




\Nk,i -rjo Nk\^ >^hl>0 


Vg e PAi.(Afc), 


( 120 ) 


where denotes the orthogonal projection onto Mk- Now observe that, for k large 
enough, it is also true that for any q G p_A;(j,(Afc) there is a point p G DQ(TQ^t^^^^) which is 
in the proximity of q, at a scale much smaller than a. A very favorable situation is when 
q belongs to the contact set where coincides with Q (and thus p = q)'. in this 

case 

lim/ Y \^k,i -r}o Nk? = 0 

On the other hand, even if this is not the case, it is possible to argue that for some radius 
p « a the integral above goes below the threshold |h|, namely 



2 

< jK 


( 121 ) 


Now, the reader familiar with Morrey spaces will realize that (120) and (121) force the 
existence of some intermediate radius t{q) G]p, a[ with 


cpd 


K< 


1 

t(g)™-2+« 



( 122 ) 


where cp is a universal constant. 

For each p G A^ let t{p) be t{pj<^^{q)). Now using an elementary covering argument 
we can cover A^ with balls Bioppq(pj) in such a way that the balls B 2 t{pi){Pi) are disjoint. 
Being Adfc almost flat it is not difficult to see that even the balls Bpqp^qi) := 'Bt(pi)(PA(j,(Pi)) 
must be disjoint. Since A^ has j^ieasure larger than p/2, we achieve 

5 < Vi..„-2+„(10«(*))”-^+” < V ^ / \DNp 

^ i i ■'».(,.)(«) 

Crr'^ P 

/ \DNk\^<Ca^. (123) 

% JBiipk) 

In the last inequality we have used that h^, the norm of N^, controls the Dirichlet 
energy, recall (117), (118) and (119). Although the constant C in (123) depends upon 
the sequence of rescalings, the current T and several other parameters, it does not depend 
upon a and p. Thus for a suitable choice of a we reach a contradiction. 
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